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(54) Antigen library immunization 

(57) This invention is directed to antigen library im- 
munization, which provides methods for obtaining anti- 
gens having improved properties for therapeutic and 
other uses. The methods are useful for obtaining im- 



proved antigens that can induce an immune response 
against pathogens, cancer, and other conditions, as well 
as antigens that are effective in modulating allergy, in- 
flammatory and autoimmune diseases. 
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Description 

BACKGROUND OF THE INVENTION 
5 Field of the Invention 

[0001] This invention pertains to the field of methods for developing immunogens that can induce efficient immune 
responses against a broad range of antigens. 

10 Background 

[0002] The interactions between pathogens and hosts are results of millions of years of evolution, during which the 
mammalian immune system has evolved sophisticated means to counterattack pathogen invasions. However, bacterial 
and viral pathogens have simultaneously gained a number of mechanisms to improve their virulence and survival in 

15 hosts, providing a major challenge for vaccine research and development despite the powers of modem techniques 
of molecular and cellular biology. Similar to the evolution of pathogen antigens, several cancer antigens are likely to 
have gained means to downregulate their immunogenicity as a mechanism to escape the host immune system. 
[0003] Efficient vaccine development is also hampered by the antigenic heterogeneity of different strains of patho- 
gens, driven in part by evolutionary forces as means for the pathogens to escape immune defenses. Pathogens also 

20 reduce their immunogenicity by selecting antigens that are difficult to express, process and/or transport in host cells, 
thereby reducing the availability of immunogenic peptides to the molecules initiating and modulating immune respons- 
es. The mechanisms associated with these challenges are complex, multivariate and rather poorly characterized. Ac- 
cordingly, a need exists for vaccines that can induce a protective immune response against bacterial and viral patho- 
gens. The present invention fulfills this and other needs. 

25 

SUMMARY OF THE INVENTION 

[0004] The present invention provides recombinant multivalent antigenic polypeptides that include a first antigenic 
determinant from a first disease-associated polypeptide and at least a second antigenic determinant from a second 
30 disease-associated polypeptide. The disease-associated polypeptides can be selected from the group consisting of 
cancer antigens, antigens associated with autoimmunity disorders, antigens associated with inflammatory conditions, 
antigens associated with allergic reactions, antigens associated with infectious agents, and other antigens that are 
associated with a disease condition. 

[0005] In another embodiment, the invention provides a recombinant antigen library that contains recombinant nu- 
35 oleic acids that encode antigenic polypeptides. The libraries are typically obtained by recombining at least first and 
second forms of a nucleic acid which includes a polynucleotide sequence that encodes a disease-associated antigenic 
polypeptide, wherein the first and second forms differ from each other in two or more nucleotides, to produce a library 
of recombinant nucleic acids. 

[0006] Another embodiment of the invention provides methods of obtaining a polynucleotide that encodes a recom- 

40 binant antigen having improved ability to induce an immune response to a disease condition. These methods involve: 
(1) recombining at least first and second forms of a nucleic acid which comprises a polynucleotide sequence that 
encodes an antigenic polypeptide that is associated with the disease condition, wherein the first and second forms 
differ from each other in two or more nucleotides, to produce a library of recombinant nucleic acids; and (2) screening 
the library to identify at least one optimized recombinant nucleic acid that encodes an optimized recombinant antigenic 

45 polypeptide that has improved ability to induce an immune response to the disease condition. 

[0007] These methods optionally further involve: (3) recombining at least one optimized recombinant nucleic acid 
with a further form of the nucleic acid, which is the same or different from the first and second forms, to produce a 
further library of recombinant nucleic acids; (4) screening the further library to identify at least one further optimized 
recombinant nucleic acid that encodes a polypeptide that has improved ability to induce an immune response to the 

50 disease condition; and (5) repeating (3) and (4), as necessary, until the further optimized recombinant nucleic acid 
encodes a polypeptide that has improved ability to induce an immune response to the disease condition. 
[0008] In some embodiments, the optimized recombinant nucleic acid encodes a multivalent antigenic polypeptide 
and the screening is accomplished by expressing the library of recombinant nucleic acids in a phage display expression 
vector such that the recombinant antigen is expressed as a fusion protein with a phage polypeptide that is displayed 

55 on a phage particle surface; contacting the phage with a first antibody that is specific for a first serotype of the pathogenic 
agent and selecting those phage that bind to the first antibody; and contacting those phage that bind to the first antibody 
with a second antibody that is specific for a second serotype of the pathogenic agent and selecting those phage that 
bind to the second antibody; wherein those phage that bind to the first antibody and the second antibody express a 



2 



EP 1 555 316 A2 



multivalent antigenic polypeptide. 

[0009] The invention also provides methods of obtaining a recombinant viral vector which has an enhanced ability 
to induce an antiviral response in a cell. These methods can include the steps of: (1) recombining at least first and 
second forms of a nucleic acid which comprise a viral vector, wherein the first and second forms differ from each other 
5 in two or more nucleotides, to produce a library of recombinant viral vectors; (2) transfecting the library of recombinant 
viral vectors into a population of mammalian cells; (3) staining the cells for the presence of Mx protein; and (4) isolating 
recombinant viral vectors from cells which stain positive for Mx protein, wherein recombinant viral vectors from positive 
staining cells exhibit enhanced ability to induce an antiviral response. 

10 BRIEF DESCRIPTION OF THE FIGURES 

[0010] 

Figure 1 shows a schematic representation of a method for generating a chimeric, multivalent antigen that has 
15 immunogenic regions from multiple antigens. Antibodies to each of the non-chimeric parental immunogenic 

polypeptides are specific for the respective organisms (A, B, C). After carrying out the recombination and selection 
methods of the invention, however, a chimeric immunogenic polypeptide is obtained that is recognized by antibod- 
ies raised against each of the three parental immunogenic polypeptides. 

Figure 2 shows the principle of family DNA shuffling. A family of antigen genes from related pathogens are subjected 
20 to shuffling, which results in a library of chimeric and/or mutated antigens. Screening methods are employed to 

identify those recombinant antigens that are the most immunogenic and/or cross-protective. These can, if desired, 
be subjected to additional rounds of shuffling and screening. 

Figure 3A-Figure 3B shows a schematic for a method by which one can obtain recombinant polypeptides that can 
induce a broad-spectrum immune response. In Figure 3A, wild-type immunogenic polypeptides from the pathogens 

25 A, B, and C provide protection against the corresponding pathogen from which the polypeptide is derived, but little 

or no cross-protection against the other pathogens (left panel). After shuffling, an A/B/C chimeric polypeptide is 
obtained that can induce a protective immune response against all three pathogen types (right panel). In Figure 
3B, shuffling is used with substrate nucleic acids from two pathogen strains (A, B), which encode polypeptides 
that are protective only against the corresponding pathogen. After shuffling, the resulting chimeric polypeptide can 

30 induce an immune response that is effective against not only the two parental pathogen strains, but also against 

a third strain of pathogen (C). 

Figure 4 diagrams some of the possible factors that can determine whether a particular polynucleotide encodes 
an immunogenic polypeptide having a desired property, such as enhanced immunogenicity and/or cross-reactivity. 
Those sequence regions that positively affect a particular property are indicated as plus signs along the antigen 
35 gene, while those sequence regions that have a negative effect are shown as minus signs. A pool of related antigen 

genes are shuffled and screened to obtain those that recombinant nucleic acids that have gained positive sequence 
regions and lost negative regions. No pre-existing knowledge as to which regions are positive or negative for a 
particular trait is required. 

Figure 5 shows a schematic representation of the screening strategy for antigen library screening. 
40 Figure 6 shows a schematic representation of a strategy for pooling and deconvolution as used in antigen library 

screening. 

Figure 7 is an alignment of the nucleotide sequences of glycoprotein D (gD) from HSV-1 (SEQ ID NO:1 ) and HSV- 
2 (gD-1 (SEQ ID NO:2) and gD-2 (SEQ ID NO:3)). 

Figure 8A shows a diagram of a method for expressing HIV gp1 20 using genetic vaccine vectors and generation 
45 of a library of shuffled gp1 20 genes. Figure 8B shows PCR primers that are useful for obtaining gp1 20 nucleic acid 

substrates for DNA shuffling reactions. Primers suitable for generating substrates include 6025F (SEQ ID NO:4), 

7773R (SEQ ID NO:5), and primers suitable for amplifying the shuffled nucleic acids include 61 96F (SEQ ID NO: 

6) and 7746R (SEQ ID NO:7). The primer BssH2-6205F (SEQ ID NO:8) can be used to clone the resulting fragment 

into a genetic vaccine vector. 
50 Figure 9 shows the domain structure of hepadnavirus envelope genes. 

Figure 10 shows a schematic representation of the use of shuffling to obtain hepadnavirus proteins in which the 

immunogenicity of one antigenic domain is improved. 

Figure 11 shows a strategy in which genes that encode the hepadnavirus proteins having one antigenic domain 
that has improved immunogenicity are shuffled to obtain recombinant proteins in which all three domains have 
55 improved immunogenicity. 

Figure 12 shows the transmembrane organization of the HBsAg polypeptide. 

Figure 13 shows a method for using phage display to obtain recombinant allergens that are not bound by pre- 
existing IgE. 
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Figure 14 shows a strategy for screening of recombinant allergens to identify those that are effective in activating 
T H cells. PBMC or T cell clones from atopic individuals are exposed to antigen-presenting cells that display the 
antigen variants obtained using the methods of the invention. To identify those allergen variants that are effective 
in activating T cells, the cultures are tested for induction of T cell proliferation or for a pattern of cytokine synthesis 
5 that is indicative of the particular type of T cell activation that is desired. If desired, the allergen variants that test 

positive in the in vitro assay can be subjected to in vivo testing. 

Figure 15 shows a strategy for screening of recombinant cancer antigens to identify those that are effective in 
activating T cells of cancer patients. 

Figure 1 6A and Figure 1 6B show two different strategies for generating vectors that contain multiple T cell epitopes 
10 obtained, for example, by DNA shuffling. In Figure 16A, each individual shuffled epitope-encoding nucleic acid is 

linked to a single promoter, and multiple promoter-epitope gene constructs can be placed in a single vector. The 
scheme shown in Figure 16B involves linking multiple epitope-encoding nucleic acids to a single promoter. 
Figure 17 shows the sequences of PreS2-S coding regions (SEQ ID NOS:9 and 11) and corresponding amino 
acid sequences (SEO ID NOS:1 0 and 1 2) of different hepatitis B surface antigen (HBsAg) or woodchuck hepatitis 
15 B (WHV) proteins (SEQ ID NOS:15 and 1 6). Primers suitable for amplification of this region are also shown (HBV, 

SEQ ID NOS:13 and 14; WHV, SEQ ID NOS:17 and 18). 

Figure 18 shows primers (SEQ ID NOS:19-22) that are suitable for amplification of large fragments that contain 
the S2S coding sequences. The primers hybridize to regions that are approximately 200 bp outside the desired 
sequences. 

20 Figure 1 9 shows an alignment of the amino acid sequences of surface antigens from different HVB subtypes (SEQ 

ID NOS:10 and 12). 

Figure 20 shows a diagram of multimeric particles that assemble when an appropriate number of chimeric polypep- 
tides and native HBsAg S monomers are mixed. 

25 DETAILED DESCRIPTION 

Definitions 

[0011] The term "screening" describes, in general, a process that identifies optimal antigens. Several properties of 

30 the antigen can be used in selection and screening including antigen expression, folding, stability, immunogenicity and 
presence of epitopes from several related antigens. Selection is a form of screening in which identification and physical 
separation are achieved simultaneously by expression of a selection marker, which, in some genetic circumstances, 
allows cells expressing the marker to survive while other cells die (or vice versa). Screening markers include, for 
example, luciferase, beta-galactosidase and green fluorescent protein. Selection markers include drug and toxin re- 

35 sistance genes, and the like. Because of limitations in studying primary immune responses in vitro, in vivo studies are 
particularly useful screening methods. In these studies, the antigens are first introduced to test animals, and the immune 
responses are subsequently studied by analyzing protective immune responses or by studying the quality or strength 
of the induced immune response using lymphoid cells derived from the immunized animal. Although spontaneous 
selection can and does occur in the course of natural evolution, in the present methods selection is performed by man. 

40 [0012] A "exogenous DNA segment", "heterologous sequence" or a "heterologous nucleic acid", as used herein, is 
one that originates from a source foreign to the particular host cell, or, if from the same source, is modified from its 
original form. Thus, a heterologous gene in a host cell includes a gene that is endogenous to the particular host cell, 
but has been modified. Modification of a heterologous sequence in the applications described herein typically occurs 
through the use of DNA shuffling. Thus, the terms refer to a DNA segment which is foreign or heterologous to the cell, 

45 or homologous to the cell but in a position within the host cell nucleic acid in which the element is not ordinarily found. 
Exogenous DNA segments are expressed to yield exogenous polypeptides. 

[0013] The term "gene" is used broadly to refer to any segment of DNA associated with a biological function. Thus, 
genes include coding sequences and/or the regulatory sequences required for their expression. Genes also include 
nonexpressed DNA segments that, for example, form recognition sequences for other proteins. Genes can be obtained 
50 from a variety of sources, including cloning from a source of interest or synthesizing from known or predicted sequence 
information, and may include sequences designed to have desired parameters. 

[0014] The term "isolated", when applied to a nucleic acid or protein, denotes that the nucleic acid or protein is 
essentially free of other cellular components with which it is associated in the natural state. It is preferably in a homo- 
geneous state although it can be in either a dry or aqueous solution. Purity and homogeneity are typically determined 
55 using analytical chemistry techniques such as polyacrylamide gel electrophoresis or high performance liquid chroma- 
tography. A protein which is the predominant species present in a preparation is substantially purified. In particular, an 
isolated gene is separated from open reading frames which flank the gene and encode a protein other than the gene 
of interest. The term "purified" denotes that a nucleic acid or protein gives rise to essentially one band in an electro- 
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phoretic gel. Particularly, it means that the nucleic acid or protein is at least about 50% pure, more preferably at least 
about 85% pure, and most preferably at least about 99% pure. 

[0015] The term "naturally-occurring" is used to describe an object that can be found in nature as distinct from being 
artificially produced by man. For example, a polypeptide or polynucleotide sequence that is present in an organism 
5 (including viruses, bacteria, protozoa, insects, plants or mammalian tissue) that can be isolated from a source in nature 
and which has not been intentionally modified by man in the laboratory is naturally-occurring. 

[001 6] The term "nucleic acid" refers to deoxyribonucleotides or ribonucleotides and polymers thereof in either single- 
or double-stranded form. Unless specifically limited, the term encompasses nucleic acids containing known analogues 
of natural nucleotides which have similar binding properties as the reference nucleic acid and are metabolized in a 

10 manner similar to naturally occurring nucleotides. Unless otherwise indicated, a particular nucleic acid sequence also 
implicitly encompasses conservatively modified variants thereof (e.g. degenerate codon substitutions) and comple- 
mentary sequences and as well as the sequence explicitly indicated. Specifically, degenerate codon substitutions may 
be achieved by generating sequences in which the third position of one or more selected (or all) codons is substituted 
with mixed-base and/or deoxyinosine residues (Batzer etal. (1991) Nucleic Acid Res. 19: 5081; Ohtsukaef a/. (1985) 

15 J. Biol. Chem. 260:2605-2608; Cassoletal. (1992); Rossolini etal. (1994) Mol. Cell. Probes 8: 91 -98). The term nucleic 
acid is used interchangeably with gene, cDNA, and mRNA encoded by a gene. 

[0017] "Nucleic acid derived from a gene" refers to a nucleic acid for whose synthesis the gene, or a subsequence 
thereof, has ultimately served as a template. Thus, an mRNA, a cDNA reverse transcribed from an mRNA, an RNA 
transcribed from that cDNA, a DNA amplified from the cDNA, an RNA transcribed from the amplified DNA, etc., are all 
20 derived from the gene and detection of such derived products is indicative of the presence and/or abundance of the 
original gene and/or gene transcript in a sample. 

[0018] A nucleic acid is "operably linked" when it is placed into a functional relationship with another nucleic acid 
sequence. For instance, a promoter or enhancer is operably linked to a coding sequence if it increases the transcription 
of the coding sequence. Operably linked means that the DNA sequences being linked are typically contiguous and, 
25 where necessary to join two protein coding regions, contiguous and in reading frame. However, since enhancers gen- 
erally function when separated from the promoter by several kilobases and intronic sequences may be of variable 
lengths, some polynucleotide elements may be operably linked but not contiguous. 

[0019] A specific binding affinity between two molecules, for example, a ligand and a receptor, means a preferential 
binding of one molecule for another in a mixture of molecules. The binding of the molecules can be considered specific 

30 if the binding affinity is about 1 x 1 0 4 M" 1 to about 1 x 1 0 6 M" 1 or greater. 

[0020] The term "recombinant" when used with reference to a cell indicates that the cell replicates a heterologous 
nucleic acid, or expresses a peptide or protein encoded by a heterologous nucleic acid. Recombinant cells can contain 
genes that are not found within the native (non-recombinant) form of the cell. Recombinant cells can also contain genes 
found in the native form of the cell wherein the genes are modified and re-introduced into the cell by artificial means. 

35 The term also encompasses cells that contain a nucleic acid endogenous to the cell that has been modified without 
removing the nucleic acid from the cell; such modifications include those obtained by gene replacement, site-specific 
mutation, and related techniques. 

[0021 ] A "recombinant expression cassette" or simply an "expression cassette" is a nucleic acid construct, generated 
recombinantly or synthetically, with nucleic acid elements that are capable of effecting expression of a structural gene 

40 in hosts compatible with such sequences. Expression cassettes include at least promoters and optionally, transcription 
termination signals. Typically, the recombinant expression cassette includes a nucleic acid to be transcribed (e.g., a 
nucleic acid encoding a desired polypeptide), and a promoter. Additional factors necessary or helpful in effecting ex- 
pression may also be used as described herein. For example, an expression cassette can also include nucleotide 
sequences that encode a signal sequence that directs secretion of an expressed protein from the host cell. Transcription 

45 termination signals, enhancers, and other nucleic acid sequences that influence gene expression, can also be included 
in an expression cassette. 

[0022] A "multivalent antigenic polypeptide" or a "recombinant multivalent antigenic polypeptide" is a non-naturally 
occurring polypeptide that includes amino acid sequences from more than one source polypeptide, which source 
polypeptide is typically a naturally occurring polypeptide. At least some of the regions of different amino acid sequences 
50 constitute epitopes that are recognized by antibodies found in a mammal that has been injected with the source polypep- 
tide. The source polypeptides from which the different epitopes are derived are usually homologous (i.e., have the 
same or a similar structure and/or function), and are often from different isolates, serotypes, strains, species, of organ- 
ism or from different disease states, for example. 

[0023] The terms "identical" or percent "identity," in the context of two or more nucleic acid or polypeptide sequences, 
55 refer to two or more sequences or subsequences that are the same or have a specified percentage of amino acid 
residues or nucleotides that are the same, when compared and aligned for maximum correspondence, as measured 
using one of the following sequence comparison algorithms or by visual inspection. 

[0024] The phrase "substantially identical," in the context of two nucleic acids or polypeptides, refers to two or more 
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sequences or subsequences that have at least 60%, preferably 80%, most preferably 90-95% nucleotide or amino acid 
residue identity, when compared and aligned for maximum correspondence, as measured using one of the following 
sequence comparison algorithms or by visual inspection. Preferably, the substantial identity exists over a region of the 
sequences that is at least about 50 residues in length, more preferably over a region of at least about 100 residues, 
and most preferably the sequences are substantially identical over at least about 1 50 residues. In some embodiments, 
the sequences are substantially identical over the entire length of the coding regions. 

[0025] For sequence comparison, typically one sequence acts as a reference sequence to which test sequences 
are compared. When using a sequence comparison algorithm, test and reference sequences are input into a computer, 
subsequence coordinates are designated, if necessary, and sequence algorithm program parameters are designated. 
The sequence comparison algorithm then calculates the percent sequence identity for the test sequence(s) relative to 
the reference sequence, based on the designated program parameters. 

[0026] Optimal alignment of sequences for comparison can be conducted, e.g., by the local homology algorithm of 
Smith & Waterman, Adv. Appl. Math. 2:482 (1981), by the homology alignment algorithm ofNeedleman & Wunsch, J. 
Mol. Biol. 48:443 (1 970), by the search for similarity method of Pearson & Lipman, Proc. Nat'l. Acad. Sci. USA 85:2444 
(1988), by computerized implementations of these algorithms (GAP, BESTFIT, PASTA, and TFASTA in the Wisconsin 
Genetics Software Package, Genetics Computer Group, 575 Science Dr., Madison, Wl), or by visual inspection (see 
generally Ausubel et ai, Infra). 

[0027] One example of an algorithm that is suitable for determining percent sequence identity and sequence similarity 
is the BLAST algorithm, which is described in Altschul et al., J. Mol. Biol. 21 5:403-41 0 (1 990). Software for performing 
BLAST analyses is publicly available through the National Center for Biotechnology Information (http://www.ncbi.nlm. 
nih.gov/. This algorithm involves first identifying high scoring sequence pairs (HSPs) by identifying short words of length 
W in the query sequence, which either match or satisfy some positive-valued threshold score T when aligned with a 
word of the same length in a database sequence. T is referred to as the neighborhood word score threshold (Altschul 
et ai., supra). These initial neighborhood word hits act as seeds for initiating searches to find longer HSPs containing 
them. The word hits are then extended in both directions along each sequence for as far as the cumulative alignment 
score can be increased. Cumulative scores are calculated using, for nucleotide sequences, the parameters M (reward 
score for a pair of matching residues; always > 0) and N (penalty score for mismatching residues; always < 0). For 
amino acid sequences, a scoring matrix is used to calculate the cumulative score. Extension of the word hits in each 
direction are halted when: the cumulative alignment score falls off by the quantity X from its maximum achieved value; 
the cumulative score goes to zero or below, due to the accumulation of one or more negative-scoring residue align- 
ments; or the end of either sequence is reached. The BLAST algorithm parameters W, T, and X determine the sensitivity 
and speed of the alignment. The BLASTN program (for nucleotide sequences) uses as defaults a wordlength (W) of 
1 1 , an expectation (E) of 1 0, a cutoff of 1 00, M=5, N=-4, and a comparison of both strands. For amino acid sequences, 
the BLASTP program uses as defaults a wordlength (W) of 3, an expectation (E) of 10, and the BLOSUM62 scoring 
matrix (see Henikoff & Henikoff (1989) Proc. Natl. Acad. Sci. USA 89:10915). 

[0028] In addition to calculating percent sequence identity, the BLAST algorithm also performs a statistical analysis 
of the similarity between two sequences (see, e.g., Karlin & Altschul (1 993) Proc. Nat'l. Acad. Sci. USA 90:5873-5787). 
One measure of similarity provided by the BLAST algorithm is the smallest sum probability (P(N)), which provides an 
indication of the probability by which a match between two nucleotide or amino acid sequences would occur by chance. 
For example, a nucleic acid is considered similar to a reference sequence if the smallest sum probability in a comparison 
of the test nucleic acid to the reference nucleic acid is less than about 0.1 , more preferably less than about 0.01 , and 
most preferably less than about 0.001 . 

[0029] Another indication that two nucleic acid sequences are substantially identical is that the two molecules hy- 
bridize to each other under stringent conditions. The phrase "hybridizing specifically to", refers to the binding, duplexing, 
or hybridizing of a molecule only to a particular nucleotide sequence under stringent conditions when that sequence 
is present in a complex mixture (e.g., total cellular) DNA or RNA. "Bind(s) substantially" refers to complementary hy- 
bridization between a probe nucleic acid and a target nucleic acid and embraces minor mismatches that can be ac- 
commodated by reducing the stringency of the hybridization media to achieve the desired detection of the target poly- 
nucleotide sequence. 

[0030] "Stringent hybridization conditions" and "stringent hybridization wash conditions" in the context of nucleic acid 
hybridization experiments such as Southern and northern hybridizations are sequence dependent, and are different 
under different environmental parameters. Longer sequences hybridize specifically at higher temperatures. An exten- 
sive guide to the hybridization of nucleic acids is found in Tijssen (1993) Laboratory Techniques in Biochemistry and 
Molecular Biology-Hybridization with Nucleic Acid Probes part I chapter 2 "Overview of principles of hybridization and 
the strategy of nucleic acid probe assays", Elsevier, New York. Generally, highly stringent hybridization and wash 
conditions are selected to be about 5° C lower than the thermal melting point (T m ) for the specific sequence at a defined 
ionic strength and pH. Typically, under "stringent conditions" a probe will hybridize to its target subsequence, but to no 
other sequences. 
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[0031] The T m is the temperature (under defined ionic strength and pH) at which 50% of the target sequence hybrid- 
izes to a perfectly matched probe. Very stringent conditions are selected to be equal to the T m for a particular probe. 
An example of stringent hybridization conditions for hybridization of complementary nucleic acids which have more 
than 100 complementary residues on a filter in a Southern or northern blot is 50% formamide with 1 mg of heparin at 

5 42°C, with the hybridization being carried out overnight. An example of highly stringent wash conditions is 0.15M NaCI 
at 72°C for about 15 minutes. An example of stringent wash conditions is a 0.2x SSC wash at 65°C for 15 minutes 
(see, Sambrook, infra., for a description of SSC buffer). Often, a high stringency wash is preceded by a low stringency 
wash to remove background probe signal. An example medium stringency wash for a duplex of, e.g., more than 100 
nucleotides, is 1x SSC at45°C for 15 minutes. An example low stringency wash for a duplex of, e.g., more than 100 

10 nucleotides, is 4-6x SSC at 40°C for 1 5 minutes. For short probes (e.g., about 1 0 to 50 nucleotides), stringent conditions 
typically involve salt concentrations of less than about 1 .0 M Na + ion, typically about 0.01 to 1 .0 M Na+ ion concentration 
(or other salts) at pH 7.0 to 8.3, and the temperature is typically at least about 30°C. Stringent conditions can also be 
achieved with the addition of destabilizing agents such as formamide. In general, a signal to noise ratio of 2x (or higher) 
than that observed for an unrelated probe in the particular hybridization assay indicates detection of a specific hybrid- 

15 ization. Nucleic acids which do not hybridize to each other under stringent conditions are still substantially identical if 
the polypeptides which they encode are substantially identical. This occurs, e.g., when a copy of a nucleic acid is 
created using the maximum codon degeneracy permitted by the genetic code. 

[0032] A further indication that two nucleic acid sequences or polypeptides are substantially identical is that the 
polypeptide encoded by the first nucleic acid is immunologically cross reactive with, or specifically binds to, the polypep- 
20 tide encoded by the second nucleic acid. Thus, a polypeptide is typically substantially identical to a second polypeptide, 
for example, where the two peptides differ only by conservative substitutions. 

[0033] The phrase "specifically (or selectively) binds to an antibody" or "specifically (or selectively) immunoreactive 
with", when referring to a protein or peptide, refers to a binding reaction which is determinative of the presence of the 
protein, or an epitope from the protein, in the presence of a heterogeneous population of proteins and other biologies. 

25 Thus, under designated immunoassay conditions, the specified antibodies bind to a particular protein and do not bind 
in a significant amount to other proteins present in the sample. The antibodies raised against a multivalent antigenic 
polypeptide will generally bind to the proteins from which one or more of the epitopes were obtained. Specific binding 
to an antibody under such conditions may require an antibody that is selected for its specificity for a particular protein. 
A variety of immunoassay formats may be used to select antibodies specifically immunoreactive with a particular pro- 

30 tein. For example, solid-phase ELISA immunoassays, Western blots, or immunohistochemistry are routinely used to 
select monoclonal antibodies specifically immunoreactive with a protein. See Harlow and Lane (1988) Antibodies, A 
Laboratory Manual, Cold Spring Harbor Publications, New York "Harlow and Lane"), for a description of immunoassay 
formats and conditions that can be used to determine specific immunoreactivity. Typically a specific or selective reaction 
will be at least twice background signal or noise and more typically more than 10 to 100 times background. 

35 [0034] "Conservatively modified variations" of a particular polynucleotide sequence refers to those polynucleotides 
that encode identical or essentially identical amino acid sequences, or where the polynucleotide does not encode an 
amino acid sequence, to essentially identical sequences. Because of the degeneracy of the genetic code, a large 
number of functionally identical nucleic acids encode any given polypeptide. For instance, the codons CGU, CGC, 
CGA, CGG, AGA, and AGG all encode the amino acid arginine. Thus, at every position where an arginine is specified 

40 by a codon, the codon can be altered to any of the corresponding codons described without altering the encoded 
polypeptide. Such nucleic acid variations are "silent variations," which are one species of "conservatively modified 
variations." Every polynucleotide sequence described herein which encodes a polypeptide also describes every pos- 
sible silent variation, except where otherwise noted. One of skill will recognize that each codon in a nucleic acid (except 
AUG, which is ordinarily the only codon for methionine) can be modified to yield a functionally identical molecule by 

45 standard techniques. Accordingly, each "silent variation" of a nucleic acid which encodes a polypeptide is implicit in 
each described sequence. 

[0035] Furthermore, one of skill will recognize that individual substitutions, deletions or additions which alter, add or 
delete a single amino acid or a small percentage of amino acids (typically less than 5%, more typically less than 1%) 
in an encoded sequence are "conservatively modified variations" where the alterations result in the substitution of an 
50 amino acid with a chemically similar amino acid. Conservative substitution tables providing functionally similar amino 
acids are well known in the art. The following five groups each contain amino acids that are conservative substitutions 
for one another: 

Aliphatic: Glycine (G), Alanine (A), Valine (V), Leucine (L), Isoleucine (I); 
55 Aromatic: Phenylalanine (F), Tyrosine (Y), Tryptophan (W); 

Sulfur-containing: Methionine (M), Cysteine (C); 
Basic: Arginine (R), Lysine (K), Histidine (H); 

Acidic: Aspartic acid (D), Glutamic acid (E), Asparagine (N), Glutamine (Q). 
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See also, Creighton (1 984) Proteins, W.H. Freeman and Company, for additional groupings of amino acids. In addition, 
individual substitutions, deletions or additions which alter, add or delete a single amino acid or a small percentage of 
amino acids in an encoded sequence are also "conservatively modified variations". 

[0036] A "subsequence" refers to a sequence of nucleic acids or amino acids that comprise a part of a longer se- 
5 quence of nucleic acids or amino acids (e.g., polypeptide) respectively. 

Description of the Preferred Embodiments 

[0037] The invention provides a new approach to vaccine development, which is termed "antigen library immuniza- 

10 tion." No other technologies are available for generating libraries of related antigens or optimizing known protective 
antigens. The most powerful previously existing methods for identification of vaccine antigens, such as high throughput 
sequencing or expression library immunization, only explore the sequence space provided by the pathogen genome. 
These approaches are likely to be insufficient, because they generally only target single pathogen strains, and because 
natural evolution has directed pathogens to downregulate their own immunogenicity. In contrast, the immunization 

15 protocols of the invention, which use shuffled antigen libraries, provide a means to identify novel antigen sequences. 
Those antigens that are most protective can be selected from these pools by in vivo challenge models. Antigen library 
immunization dramatically expands the diversity of available immunogen sequences, and therefore, these antigen 
chimera libraries can also provide means to defend against newly emerging pathogen variants of the future. The meth- 
ods of the invention enable the identification of individual chimeric antigens that provide efficient protection against a 

20 variety of existing pathogens, providing improved vaccines for troops and civilian populations. 

[0038] The methods of the invention provide an evolution-based approach, such as DNA shuffling in particular, that 
is an optimal approach to improve the immunogenicity of many types of antigens. For example, the methods provide 
means of obtaining optimized cancer antigens useful for preventing and treating malignant diseases. Furthermore, an 
increasing number of self-antigens, causing autoimmune diseases, and allergens, causing atopy, allergy and asthma, 

25 have been characterized. The immunogenicity and manufacturing of these antigens can likewise be improved with the 
methods of this invention. 

[0039] The antigen library immunization methods of the invention provide a means by which one can obtain a re- 
combinant antigen that has improved ability to induce an immune response to a pathogenic agent. A "pathogenic agent" 
refers to an organism or virus that is capable of infecting a host cell. Pathogenic agents typically include and/or encode 

30 a molecule, usually a polypeptide, that is immunogenic in that an immune response is raised against the immunogenic 
polypeptide. Often, the immune response raised against an immunogenic polypeptide from one serotype of the path- 
ogenic agent is not capable of recognizing, and thus protecting against, a different serotype of the pathogenic agent, 
or other related pathogenic agents. In other situations, the polypeptide produced by a pathogenic agent is not produced 
in sufficient amounts, or is not sufficiently immunogenic, for the infected host to raise an effective immune response 

35 against the pathogenic agent. 

[0040] These problems are overcome by the methods of the invention, which typically involve recombining two or 
more forms of a nucleic acid that encode a polypeptide of the pathogenic agent, or antigen involved in another disease 
or condition. These recombination methods, referred to herein as "DNA shuffling", use as substrates forms of the 
nucleic acid that differ from each other in two or more nucleotides, so a library of recombinant nucleic acids results. 

40 The library is then screened to identify at least one optimized recombinant nucleic acid that encodes an optimized 
recombinant antigen that has improved ability to induce an immune response to the pathogenic agent or other condition. 
The resulting recombinant antigens often are chimeric in that they are recognized by antibodies (Abs) reacting against 
multiple pathogen strains, and generally can also elicit broad spectrum immune responses. Specific neutralizing anti- 
bodies are known to mediate protection against several pathogens of interest, although additional mechanisms, such 

45 as cytotoxic T lymphocytes, are likely to be involved. The concept of chimeric, multivalent antigens inducing broadly 
reacting antibody responses is further illustrated in Figure 1. 

[0041] In preferred embodiments, the different forms of the nucleic acids that encode antigenic polypeptides are 
obtained from members of a family of related pathogenic agents. This scheme of performing DNA shuffling using 
nucleic acids from related organisms, known as "family shuffling," is described in Crameri et al. ((1998) Nature 391 : 

50 288-291 ) and is shown schematically in Figure 2. Polypeptides of different strains and serotypes of pathogens generally 
vary between 60-98%, which will allow for efficient family DNA shuffling. Therefore, family DNA shuffling provides an 
effective approach to generate multivalent, crossprotective antigens. The methods are useful for obtaining individual 
chimeras that effectively protect against most or all pathogen variants (Figure 3A). Moreover, immunizations using 
entire libraries or pools of shuffled antigen chimeras can also result in identification of chimeric antigens that protect 

55 against pathogen variants that were not included in the starting population of antigens (for example, protection against 
strain C by shuffled library of chimeras/mutants of strains A and B in Figure 3B). Accordingly, the antigen library im- 
munization approach enables the development of immunogenic polypeptides that can induce immune responses 
against poorly characterized, newly emerging pathogen variants. 
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[0042] Sequence recombination can be achieved in many different formats and permutations of formats, as described 
in further detail below. These formats share some common principles. For example, the targets for modification vary 
in different applications, as does the property sought to be acquired or improved. Examples of candidate targets for 
acquisition of a property or improvement in a property include genes that encode proteins which have immunogenic 

5 and/or toxigenic activity when introduced into a host organism. 

[0043] The methods use at least two variant forms of a starting target. The variant forms of candidate substrates can 
show substantial sequence or secondary structural similarity with each other, but they should also differ in at least one 
and preferably at least two positions. The initial diversity between forms can be the result of natural variation, e.g., the 
different variant forms (homologs) are obtained from different individuals or strains of an organism, or constitute related 

10 sequences from the same organism (e.g., allelic variations), or constitute homologs from different organisms (inter- 
specific variants). Alternatively, initial diversity can be induced, e.g., the variant forms can be generated by error-prone 
transcription, such as an error-prone PCR or use of a polymerase which lacks proof-reading activity (see, Liao (1990) 
Gene 88:107-1 11 ), of the first variant form, or, by replication of the first form in a mutator strain (mutator host cells are 
discussed in further detail below, and are generally well known). A mutator strain can include any mutants in any 

15 organism impaired in the functions of mismatch repair. These include mutant gene products of mutS, mutT, mutH, mutL, 
ovrD, dcm, vsr, umuC, umuD, sbcB, recJ, etc. The impairment is achieved by genetic mutation, allelic replacement, 
selective inhibition by an added reagent such as a small compound or an expressed antisense RNA, or other tech- 
niques. Impairment can be of the genes noted, or of homologous genes in any organism. Other methods of generating 
initial diversity include methods well known to those of skill in the art, including, for example, treatment of a nucleic 

20 acid with a chemical or other mutagen, through spontaneous mutation, and by inducing an error-prone repair system 
(e.g., SOS) in a cell that contains the nucleic acid. The initial diversity between substrates is greatly augmented in 
subsequent steps of recombination for library generation. 

Properties involved in immunogenicity 

25 

[0044] The effectiveness of an antigen in inducing an immune response against a pathogen can depend upon several 
factors, many of which are not well understood. Most previously available methods for increasing the effectiveness of 
antigens are dependent upon understanding the molecular basis for these factors. However, DNA shuffling and antigen 
library immunization according to the methods of the invention are effective even where the molecular bases are un- 

30 known. The methods of the invention do not rely upon a priori assumptions. 

[0045] Polynucleotide sequences that can positively or negatively affect the immunogenicity of an antigen encoded 
by the polynucleotide are often scattered throughout the entire antigen gene. Several of these factors are shown dia- 
grammatically in Figure 4. By recombining different forms of polynucleotide that encode the antigen using DNA shuffling, 
followed by selection for those chimeric polynucleotides that encode an antigen that can induce an improved immune 

35 response, one can obtain primarily sequences that have a positive influence on antigen immunogenicity. Those se- 
quences that negatively affect antigen immunogenicity are eliminated (Figure 4). One need not know the particular 
sequences involved. 

DNA Shuffling Methods 

40 

[0046] Generally, the methods of the invention entail performing DNA recombination ("shuffling") and screening or 
selection to "evolve" individual genes, whole plasmids or viruses, multigene clusters, or even whole genomes (Stemmer 
(1995) Biol Technology 13:549-553). Reiterative cycles of recombination and screening/selection can be performed to 
further evolve the nucleic acids of interest. Such techniques do not require the extensive analysis and computation 

45 required by conventional methods for polypeptide engineering. Shuffling allows the recombination of large numbers of 
mutations in a minimum number of selection cycles, in contrast to natural pair-wise recombination events (e.g., as 
occur during sexual replication). Thus, the sequence recombination techniques described herein provide particular 
advantages in that they provide recombination between mutations in any or all of these, thereby providing a very fast 
way of exploring the manner in which different combinations of mutations can affect a desired result. In some instances, 

50 however, structural and/or functional information is available which, although not required for sequence recombination, 
provides opportunities for modification of the technique. 

[0047] The DNA shuffling methods of the invention can involve at least one of at least four different approaches to 
improve immunogenic activity as well as to broaden specificity. First, DNA shuffling can be performed on a single gene. 
Secondly, several highly homologous genes can be identified by sequence comparison with known homologous genes. 
55 These genes can be synthesized and shuffled as a family of homologs, to select recombinants with the desired activity. 
The shuffled genes can be cloned into appropriate host cells, such as E. coii, yeast, plants, fungi, animal cells, and 
the like, and those that encode antigens having the desired properties can be identified by the methods described 
below. Third, whole genome shuffling can be performed to shuffle genes that encode antigenic polypeptides (along 



9 



EP 1 555 316 A2 



with other genomic nucleic acids). For whole genome shuffling approaches, it is not even necessary to identify which 
genes are being shuffled. Instead, e.g., bacterial cell or viral genomes are combined and shuffled to acquire recombinant 
polypeptides that have enhanced ability to induce an immune response, as measured in any of the assays described 
below. Fourth, antigenic polypeptide-encoding genes can be codon modified to access mutational diversity not present 
in any naturally occurring gene. Details on each of these procedures can be found below. 

[0048] Exemplary formats and examples for sequence recombination, sometimes referred to as DNA shuffling, ev- 
olution, or molecular breeding, have been described by the present inventors and co-workers in co-pending applications 
U.S. Patent Application Serial No. 08/198,431, filed February 17, 1994, Serial No. PCT/US95/02126, filed, February 
17, 1995, Serial No. 08/425,684, filed April 18, 1995, Serial No. 08/537,874, filed October 30, 1995, Serial No. 
08/564,955, filed November 30,1 995, Serial No. 08/621,859, filed March 25, 1996, Serial No. 08/621,430, filed March 
25, 1996, Serial No. PCT/US96/05480, filed April 18, 1996, Serial No. 08/650,400, filed May 20, 1996, Serial No. 
08/675,502, filed July 3, 1996, Serial No. 08/721, 824, filed September 27, 1996, Serial No. PCT/US97/1 7300, filed 
September 26, 1997, and Serial No. PCT/US97/24239, filed December 17, 1997; Stemmer, Science 270:1 510 (1995); 
Stemmer etal., Gene 1 64:49-53 (1 995); Stemmer, BiolTechnology 13:549-553 (1995); Stemmer, Proc. Natl. Acad. Sci. 
U.S.A. 91 :1 0747-1 0751 (1 994); Stemmer, Nature 370:389-391 (1 994); Crameri etal., Nature Medicine2^ ):1 -3 (1 996); 
Crameri etal., Nature Biotechnology 14:315-319 (1996), each of which is incorporated by reference in its entirety for 
all purposes. 

[0049] Other methods for obtaining recombinant polynucleotides and/or for obtaining diversity in nucleic acids used 
as the substrates for shuffling include, for example, homologous recombination (PCT/US98/05223; Publ. No. 
W098/42727); oligonucleotide-directed mutagenesis (for review see, Smith, Ann. Rev. Genet. 19: 423-462 (1985); 
Botstein and Shortle, Science 229: 1193-1201 (1985); Carter, Biochem. J. 237:1-7 (1986); Kunkel, "The efficiency of 
oligonucleotide directed mutagenesis" in Nucleic acids & Molecular Biology, Eckstein and Lilley, eds., Springer Verlag, 
Berlin (1 987)). Included among these methods are oligonucleotide-directed mutagenesis (Zoller and Smith, Nucl. Acids 
Res. 1 0: 6487-6500 (1 982), Methods in Enzymol. 1 00: 468-500 (1 983), and Methods in Enzymol. 1 54: 329-350 (1 987)) 
phosphothioate-modified DNA mutagenesis (Taylor et al, Nucl. Acids Res. 13: 8749-8764 (1985); Taylor etal, Nucl. 
Acids Res. 13: 8765-8787 (1985); Nakamaye and Eckstein, Nucl. Acids Res. 14: 9679-9698 (1986); Sayers etal., 
Nucl. Acids Res. 16: 791 -802 (1988); Sayers etai, Nucl. Acids Res. 16: 803-814 (1988)), mutagenesis using uracil- 
containing templates (Kunkel, Proc. Nat'l. Acad. Sci. USA 82: 488-492 (1985) and Kunkel etal., Methods in Enzymol. 
1 54: 367-382)); mutagenesis using gapped duplex DNA (Kramer etal., Nucl. Acids Res. 12: 9441 -9456 (1 984); Kramer 
and Fritz, Methods in Enzymol. 154: 350-367 (1987); Kramer etal., Nucl. Acids Res. 16: 7207 (1988)); and Fritz etal., 
Nucl. Acids Res. 1 6: 6987-6999 (1988)). Additional suitable methods include point mismatch repair (Kramer etal., Cell 
38: 879-887 (1984)), mutagenesis using repair-deficient host strains (Carter et al., Nucl. Acids Res. 13: 4431-4443 
(1985); Carter, Methods in Enzymol. 154: 382-403 (1987)), deletion mutagenesis (Eghtedarzadeh and Henikoff, Nucl. 
Acids Res. 14: 5115 (1986)), restriction-selection and restriction-purification (Wells et al., Phil. Trans. R. Soc. Lond. A 
317: 415-423 (1986)), mutagenesis by total gene synthesis (Nambiar etal., Science 223: 1299-1301 (1984); Sakamar 
and Khorana, Nucl. Acids Res. 14: 6361 -6372 (1988); Wells etal., Gene 34: 315-323 (1985); and Grundstrorn etal., 
Nucl. Acids Res. 13: 3305-3316 (1985). Kits for mutagenesis are commercially available (e.g., Bio-Rad, Amersham 
International, Anglian Biotechnology). 

[0050] The breeding procedure starts with at least two substrates that generally show some degree of sequence 
identity to each other (i.e., at least about 30%, 50%, 70%, 80% or 90% sequence identity), but differ from each other 
at certain positions. The difference can be any type of mutation, for example, substitutions, insertions and deletions. 
Often, different segments differ from each other in about 5-20 positions. For recombination to generate increased 
diversity relative to the starting materials, the starting materials must differ from each other in at least two nucleotide 
positions. That is, if there are only two substrates, there should be at least two divergent positions. If there are three 
substrates, for example, one substrate can differ from the second at a single position, and the second can differ from 
the third at a different single position. The starting DNA segments can be natural variants of each other, for example, 
allelic or species variants. The segments can also be from nonallelic genes showing some degree of structural and 
usually functional relatedness (e.g., different genes within a superfamily, such as the family of Yersinia V-antigens, for 
example). The starting DNA segments can also be induced variants of each other. For example, one DNA segment 
can be produced by error-prone PCR replication of the other, the nucleic acid can be treated with a chemical or other 
mutagen, or by substitution of a mutagenic cassette. Induced mutants can also be prepared by propagating one (or 
both) of the segments in a mutagenic strain, or by inducing an error-prone repair system in the cells. In these situations, 
strictly speaking, the second DNA segment is not a single segment but a large family of related segments. The different 
segments forming the starting materials are often the same length or substantially the same length. However, this need 
not be the case; for example; one segment can be a subsequence of another. The segments can be present as part 
of larger molecules, such as vectors, or can be in isolated form. 

[0051] The starting DNA segments are recombined by any of the sequence recombination formats provided herein 
to generate a diverse library of recombinant DNA segments. Such a library can vary widely in size from having fewer 
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than 10 to more than 10 5 , 10 9 , 10 12 or more members. In some embodiments, the starting segments and the recom- 
binant libraries generated will include full-length coding sequences and any essential regulatory sequences, such as 
a promoter and polyadenylation sequence, required for expression. In other embodiments, the recombinant DNA seg- 
ments in the library can be inserted into a common vector providing sequences necessary for expression before per- 
5 forming screening/selection. 

Substrates for Evolution of Optimized Recombinant Antigens 

[0052] The invention provides methods of obtaining recombinant polynucleotides that encode antigens that exhibit 
10 improved ability to induce an immune response to a pathogenic agent. The methods are applicable to a wide range of 
pathogenic agents, including potential biological warfare agents and other organisms and polypeptides that can cause 
disease and toxicity in humans and other animals. The following examples are merely illustrative, and not limiting. 

1 . Bacterial Pathogens and Toxins 

15 

[0053] In some embodiments, the methods of the invention are applied to bacterial pathogens, as well as to toxins 
produced by bacteria and other organisms. One can use the methods to obtain recombinant polypeptides that can 
induce an immune response against the pathogen, as well as recombinant toxins that are less toxic than native toxin 
polypeptides. Often, the polynucleotides of interest encode polypeptides that are present on the surface of the patho- 
20 genie organism. 

[0054] Among the pathogens for which the methods of the invention are useful for producing protective immunogenic 
recombinant polypeptides are the Yersinia species. Yersinia pestis, the causative agent of plague, is one of the most 
virulent bacteria known with LD 50 values in mouse of less than 10 bacteria. The pneumonic form of the disease is 
readily spread between humans by aerosol or infectious droplets and can be lethal within days. A particularly preferred 
25 target for obtaining a recombinant polypeptide that can protect against Yersinia infection is the V antigen, which is a 
37 kDa virulence factor, induces protective immune responses and is currently being evaluated as a subunit vaccine 
(Bru baker (1 991 ) Current Investigations of the Microbiology of Yersinae, 1 2: 1 27). The V-an tig en alone is not toxic, but 

Y pestis isolates that lack the V-antigen are avirulent. The Yersinia V-antigen has been successfully produced in E. 
coli by several groups (Learyetai. (1995) Infect. Immun. 3: 2854). Antibodies that recognize the V-antigen can provide 

30 passive protection against homologous strains, but not against heterologous strains. Similarly, immunization with pu- 
rified V antigen protects against only homologous strains. To obtain cross-protective recombinant V antigen, in a pre- 
ferred embodiment, V antigen genes from various Yersinia species are subjected to family shuffling. The genes en- 
coding the V antigen from Y pestis, Y enter ocolitica, and Y pseudotuberculosis, for example, are 92-99% identical at 
the DNA level, making them ideal for optimization using family shuffling according to the methods of the invention. 

35 After shuffling, the library of recombinant nucleic acids is screened and/or selected for those that encode recombinant 

V antigen polypeptides that can induce an improved immune response and/or have greater cross-protectivity. 
[0055] Bacillus anthracis, the causative agent of anthrax, is another example of a bacterial target against which the 
methods of the invention are useful. The anthrax protective antigen (PA) provides protective immune responses in test 
animals, and antibodies against PA also provide some protection. However, the immunogenicity of PA is relatively poor, 

40 so multiple injections are typically required when wild-type PA is used. Co-vaccination with lethal factor (LF) can improve 
the efficacy of wild-type PA vaccines, but toxicity is a limiting factor. Accordingly the DNA shuffling and antigen library 
immunization methods of the invention can be used to obtain nontoxic LF. Polynucleotides that encode LF from various 
B. anthracis strains are subjected to family shuffling. The resulting library of recombinant LF nucleic acids can then be 
screened to identify those that encode recombinant LF polypeptides that exhibit reduced toxicity. For example, one 

45 can inoculate tissue culture cells with the recombinant LF polypeptides in the presence of PA and select those clones 
for which the cells survive. If desired, one can then backcross the nontoxic LF polypeptides to retain the immunogenic 
epitopes of LF. Those that are selected through the first screen can then be subjected to a secondary screen. For 
example, one can test for the ability of the recombinant nontoxic LF polypeptides to induce an immune response (a 
g., CTL or antibody response) in a test animal such as mice. In preferred embodiments, the recombinant nontoxic LF 

50 polypeptides are then tested for ability to induce protective immunity in test animals against challenge by different 
strains of B. anthracis. 

[0056] The protective antigen (PA) of B. anthracis is also a suitable target for the methods of the invention. PA- 
encoding nucleic acids from various strains of B. anthracis are subjected to DNA shuffling. One can then screen for 
proper folding in, for example, E. coli, using polyclonal antibodies. Screening for ability to induce broad-spectrum 
55 antibodies in a test animal is also typically used, either alone or in addition to a preliminary screening method. In 
presently preferred embodiments, those recombinant polynucleotides that exhibit the desired properties can be back- 
crossed so that the immunogenic epitopes are maintained. Finally, the selected recombinants are tested for ability to 
induce protective immunity against different strains of B. anthracis in a test animal. 



11 



EP 1 555 316 A2 



[0057] The Staphylococcus aureus and Streptococcus toxins are another example of a target polypeptide that can 
be altered using the methods of the invention. Strains of Staphylococcus aureus and group A Streptococci are involved 
in a range of diseases, including food poisoning, toxic shock syndrome, scarlet fever and various autoimmune disorders. 
They secrete a variety of toxins, which include at least five cytolytic toxins, a coagulase, and a variety of enterotoxins. 
The enterotoxins are classified as superantigens in that they crosslink MHC class II molecules with T cell receptors to 
cause aconstitutiveTcell activation (Fields etal, (1996) Nature3S4\ 1 88). This results in the accumulation of pathogenic 
levels of cytokines that can lead to multiple organ failure and death. At least thirty related, yet distinct enterotoxins 
have been sequenced and can be phylogenetically grouped into families. Crystal structures have been obtained for 
several members alone and in complex with MHC class II molecules. Certain mutations in the MHC class ll-binding 
site of the toxins strongly reduce their toxicity and can form the basis of attenuated vaccines (Woody et al. (1997) 
Vaccine 15: 133). However, a successful immune response to one type of toxin may provide protection against closely 
related family members, whereas little protection against toxins from the other families is observed. Family shuffling 
of enterotoxin genes from various family members can be used to obtain recombinant toxin molecules that have reduced 
toxicity andean induce across-protective immune response. Shuffled antigens can also be screened to identify antigens 
that elicit neutralizing antibodies in an appropriate animal model such as mouse or monkey. Examples of such assays 
can include ELISA formats in which the elicited antibodies prevent binding of the enterotoxin to the MHC complex and/ 
or T cell receptors on cells or purified forms. These assays can also include formats where the added antibodies would 
prevent T cells from being cross-linked to appropriate antigen presenting cells. 

[0058] Cholera is an ancient, potentially lethal disease caused by the bacterium Vibrio cholerae and an effective 
vaccine for its prevention is still unavailable. Much of the pathogenesis of this disease is caused by the cholera enter- 
otoxin. Ingestion of microgram quantities of cholera toxin can induce severe diarrhea causing loss of tens of liters of 
fluid. Cholera toxin is a complex of a single catalytic A subunit with a pentameric ring of identical B subunits. Each 
subunit is inactive on its own. The B subunits bind to specific ganglioside receptors on the surface of intestinal epithelial 
cells and trigger the entry of the A subunit into the cell. The A subunit ADP-ribosylates a regulatory G protein initiating 
a cascade of events causing a massive, sustained flow of electrolytes and water into the intestinal lumen resulting in 
extreme diarrhea. 

[0059] The B subunit of cholera toxin is an attractive vaccine target for a number of reasons. It is a major target of 
protective antibodies generated during cholera infection and contains the epitopes for antitoxin neutralizing antibodies. 
It is nontoxic without the A subunit, is orally effective, and stimulates production of a strong IgA-dominated gut mucosal 
immune response, which is essential in protection against cholera and cholera toxin. The B subunit is also being 
investigated for use as an adjuvant in other vaccine preparations, and therefore, evolved toxins may provide general 
improvements for a variety of different vaccines. The heat-labile enterotoxins (LT) from enterotoxigenic E. coll strains 
are structurally related to cholera toxin and are 75% identical at the DNA sequence level. To obtain optimized recom- 
binant toxin molecules that exhibit reduced toxicity and increased ability to induce an immune response that is protective 
against V. cholerae and E. coil, the genes that encode the related toxins are subjected to DNA shuffling. 
[0060] The recombinant toxins are then tested for one or more of a several desirable traits. For example, one can 
screen for improved cross-reactivity of antibodies raised against the recombinant toxin polypeptides, for lack of toxicity 
in a cell culture assay, and for ability to induce a protective immune response against the pathogens and/or against 
the toxins themselves. The shuffled clones can be selected by phage display and/or screened by phage ELISA and 
ELISA assays for the presence of epitopes from the different serotypes. Variant proteins with multiple epitopes can 
then be purified and used to immunize mice or other test animal. The animal serum is then assayed for antibodies to 
the different B chain subtypes and variants that elicit a broad cross-reactive response will be evaluated further in a 
virulent challenge model. The E. coli and V cholerae toxins can also act as adjuvants that are capable of enhancing 
mucosal immunity and oral delivery of vaccines and proteins. Accordingly, one can test the library of recombinant toxins 
for enhancement of the adjuvant activity. 

[0061] Shuffled antigens can also be screened for improved expression levels and stability of the B chain pentamer, 
which may be less stable than when in the presence of the A chain in the hexameric complex. Addition of a heat 
treatment step or denaturing agents such as salts, urea, and/or guanidine hydrochloride can be included prior to ELISA 
assays to measure yields of correctly folded molecules by appropriate antibodies. It is sometimes desirable to screen 
for stable monomeric B chain molecules, in an ELISA format, for example, using antibodies that bind monomeric, but 
not pentameric B chains. Additionally, the ability of shuffled antigens to elicit neutralizing antibodies in an appropriate 
animal model such as mouse or monkey can be screened. For example, antibodies that bind to the B chain and prevent 
its binding to its specific ganglioside receptors on the surface of intestinal epithelial cells may prevent disease. Similarly 
antibodies that bind to the B chain and prevent its pentamerization or block A chain binding may be useful in preventing 
disease. 

[0062] The bacterial antigens that can be improved by DNA shuffling for use as vaccines also include, but are not 
limited to, Helicobacter pylori antigens CagA and VacA (Blaser (1996) Aliment, Pharmacol, Ther. 1 : 73-7; Blaser and 
Crabtree (1996) Am, J, Clin, Pathol, 106: 565-7; Censini etal, (1996) Proc, Nat'l, Acad, Sci, USA 93: 14648-14643). 
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Other suitable H. pylori antigens include, for example, four immunoreactive proteins of 45-65 kDaas reported by Chatha 
etal. (1997) Indian J. Med. Res. 105: 170-175 and the H. pylori GroES homologue (HspA) (Kansau etai. (1996) Mol. 
Microbiol. 22:1 013-1 023. Other suitable bacterial antigens include, but are not limited to, the 43-kDa and the fimbrilin 
(41 kDa) proteins of P. gingivalis (Boutsl etai. (1996) Oral Microbiol, Immunol. 11: 236-241); pneumococcal surface 
protein A (Briles etal. (1996) Ann. NY Acad. Sci. 797: 118-126); Chlamydia psittaci antigens, 80-90 kDa protein and 
110 kDa protein (Buendia etal. (1997) FEMS Microbiol, Lett. 150: 113-9); the chlamydial exoglycolipid antigen (GLXA) 
(Whittum-Hudson et ai. (1996) Nature Med. 2: 1116-1121 ); Chlamydia pneumoniae species-specific antigens in the 
molecular weight ranges 92-98, 51-55,43-46 and 31.5-33 kDa and genus-specific antigens in the ranges 12, 26 and 
65-70 kDa (Hal me etal. (1 997) Scand. J. Immunol. 45: 378-84); Neisseria gonorrhoeae (GC) or Escherichia coli phase- 
variable opacity (Opa) proteins (Chen and Gotschlich (1996) Proc. Natl. Acad. Sci. USA 93: 14851-14856), any of the 
twelve immunodominant proteins of Schistosoma mansoni (ranging in molecular weight from 14 to 208 kDa) as de- 
scribed by Cutts and Wilson (1 997) Parasitology 114: 245-55; the 17-kDa protein antigen of Brucella abortus (De Mot 
et ai. (1 996) Curr. Microbiol. 33: 26-30); a gene homolog of the 1 7-kDa protein antigen of the Gram-negative pathogen 
Brucella abortus identified in the nocardioform actinomycete Rhodococcus sp. N 1 86/21 (De Mot et ai. (1996) Curr. 
Microbiol. 33: 26-30); the staphylococcal enterotoxins (SEs) (Wood etal. (1997) FEMS Immunol. Med. Microbiol. 17: 
1-10), a 42-kDa M. hyopneumoniae NrdF ribonucleotide reductase R2 protein or 15-kDa subunit protein of M. hyop- 
neumoniae (Fag an etal. (1997) Infect. Immun. 65: 2502-2507), the meningococcal antigen PorA protein (Feavers et 
ai. (1997) Clin. Diagn. Lab. Immunol. 3: 444-50); pneumococcal surface protein A (PspA) (McDaniel etai. (1997) Gene 
Then 4: 375-377); F. tularensis outer membrane protein FopA (Fulop etal. (1996) FEMS Immunol. Med. Microbiol. 13: 
245-247); the major outer membrane protein within strains of the genus Actinobacillus (Hartmann etal. (1 996) Zentralbl. 
Bakteriol. 284: 255-262); p60 or listeriolysin (Hly) antigen of Listeria monocytogenes (Hess et al. (1996) Proc. Natl 
Acad. Sci. USA 93: 1458-1463); flagellar (G) antigens observed on Salmonella enteritidis and S. pullorum (Holt and 
Chaubal (1 997) J. Clin. Microbiol. 35:1 01 6-1 020); Bacillus anthracis protective antigen (PA) (Ivins etal. (1 995) Vaccine 
13: 1779-1784); Echinococcus granulosus antigen 5 (Jones et al. (1996) Parasitology 11 3: 213-222); the ro/genes of 
Shigella dysenteriae 1 and Escherichia coli K-1 2 (Klee etal. (1 997) J. Bacteriol. 1 79: 2421 -2425); cell surface proteins 
Rib and alpha of group B streptococcus (Larsson et al. (1996) Infect. Immun. 64: 3518-3523); the 37 kDa secreted 
polypeptide encoded on the 70 kb virulence plasmid of pathogenic Yersinia spp. (Leary etal. (1 995) Contrib. Microbiol. 
Immunol. 13: 216-217 and Roggenkamp et al. (1997) Infect. Immun. 65:446-51); the OspA (outer surface protein A) 
of the Lyme disease spirochete Borrelia burgdorferi (Li et al. (1 997) Proc. Nat'l. Acad. Sci. USA 94: 3584-3589, Padilla 
etal. (1996) J. Infect. Dis. 174: 739-746, and Wallich etal. (1996) Infection 24: 396-397); the Brucella melitensis group 
3 antigen gene encoding Omp28 (Lindler etal. (1 996) Infect. Immun. 64: 2490-2499); the PAc antigen of Streptococcus 
mutans (Murakami etal. (1997) Infect. Immun. 65: 794-797); pneumolysin, Pneumococcal neuraminidases, autolysin, 
hyaluronidase, and the 37 kDa pneumococcal surface adhesin A (Paton et al. (1997) Microb. Drug Resist. 3: 1-10); 
29-32, 41-45, 63-71 x 10(3) MW antigens of Salmonella typhi (Perez etai. (1996) Immunology 89: 262-267); K-antigen 
as a marker of Klebsiella pneumoniae (Priamukhina and Morozova (1996) Klin. Lab. Diagn. 47-9); nocardial antigens 
of molecular mass approximately 60, 40, 20 and 15-10 kDa (Prokesova et al. (1996) Int. J. Immunopharmacol. 18: 
661-668); Staphylococcus aureus antigen ORF-2 (Rieneck etal. (1997) Biochim Biophys Acta 1350: 128-132); GIpQ 
antigen of Borrelia hermsii (Schwan et al. (1996) J. Clin. Microbiol. 34: 2483-2492); cholera protective antigen (CPA) 
(Sciortino (I996) J. Diarrhoea! Dis. Res. 14: 16-26); a 190-kDa protein antigen of Streptococcus mutans (Senpuku et 
al. (1996) Oral Microbiol. Immunol. 11: 121-128); Anthrax toxin protective antigen (PA) (Sharma et al. (1996) Protein 
Expr. Purif. 7: 33-38); Clostridium perfringens antigens and toxoid (Strom etal. (1 995) Br. J. Rheumatol. 34: 1 095-1 096); 
the SEF14 fimbrial antigen of Salmonella enteritidis (Thorns et al. (1996) Microb. Pathog. 20: 235-246); the Yersinia 
pestis capsular antigen (F1 antigen) (Titball et al. (1997) Infect. Immun. 65: 1926-1930); a 35-kilodalton protein of 
Mycobacterium leprae (Triccas et al. (1996) Infect. Immun. 64: 5171-5177); the major outer membrane protein, CD, 
extracted from Moraxella (Branhamella) catarrhalis (Yang et al. (1997) FEMS Immunol. Med. Microbiol. 17: 187-199); 
pH6 antigen (PsaA protein) of Yersinia pestis (Zav'yalov et al. (1996) FEMS Immunol. Med. Microbiol. 14: 53-57); a 
major surface glycoprotein, gp63, of Leishmania major (Xu and Liew (1994) Vaccine 12: 1534-1536; Xu and Liew 

(1995) Immunology 84: 173-176); mycobacterial heat shock protein 65, mycobacterial antigen (Mycobacterium leprae 
hsp65) (Lowrie etal. (1994) Vaccine 12: 1537-1540; Ragno etal. (1997) Arthritis Rheum. 40: 277-283; Silva (1995) 
Braz. J. Med. Biol. Res. 28: 843-851); Mycobacterium tuberculosis antigen 85 (Ag85) (Huygen etal. (1996) Nat. Med. 
2: 893-898); the 45/47 kDa antigen complex (APA) of Mycobacterium tuberculosis, M. bovis and BCG (Horn et al. 

(1996) J. Immunol. Methods 197: 151-159); the mycobacterial antigen, 65-kDa heat shock protein, hsp65 (Tascon et 
al. (1996) Nat. Med. 2: 888-892); the mycobacterial antigens MPB64, MPB70, MPB57 and alpha antigen (Yamada et 
al. (1995) KekkakulO: 639-644); the M. tuberculosis 38 kDa protein (Vordermeier etal. (1995) Vaccine 13: 1576-1582); 
the MPT63, MPT64 and MPT-59 antigens from Mycobacterium tuberculosis (Manca et al. (1997) Infect. Immun. 65: 
16-23; Oettinger etal. (1997) Scand. J. Immunol. 45: 499-503; Wilcke etal. (1996) Tuber. Lung Dis. 77: 250-256); the 
35-kilodalton protein of Mycobacterium leprae (Triccas etal. (1 996) Infect. Immun. 64: 51 71 -51 77); the ESAT-6 antigen 
of virulent mycobacteria (Brandt etal. (1996) J. Immunol. 157: 3527-3533; Pollock and Andersen (1997) J. Infect. Dis. 
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175: 1251-1254); Mycobacterium tuberculosis 16-kDa antigen (Hsp16.3) (Chang et al. (1996) J, Biol. Chem. 271: 
7218-7223); and the 1 8-kilodalton protein of Mycobacterium leprae (Baumgart et al, (1996) Infect, Immun, 64: 
2274-2281). 

5 2. Viral Pathogens 

[0063] The methods of the invention are also useful for obtaining recombinant nucleic acids and polypeptides that 
have enhanced ability to induce an immune response against viral pathogens. While the bacterial recombinants de- 
scribed above are typically administered in polypeptide form, recombinants that confer viral protection are preferably 

10 administered in nucleic acid form, as genetic vaccines. 

[0064] One illustrative example is the Hantaan virus. Glycoproteins of this virus typically accumulate at the mem- 
branes of the Golgi apparatus of infected cells. This poor expression of the glycoprotein prevents the development of 
efficient genetic vaccines against these viruses. The methods of the invention solve this problem by performing DNA 
shuffling on nucleic acids that encode the glycoproteins and identifying those recombinants that exhibit enhanced 

15 expression in a host cell, and/or for improved immunogenicity when administered as a genetic vaccine. A convenient 
screening method for these methods is to express the recombinant polynucleotides as fusion proteins to PIG, which 
results in display of the polypeptides on the surface of the host cell (Whitehorn et al. (1995) Biotechnology (N Y) 13: 
1215-9). Fluorescence-activated cell sorting is then used to sort and recover those cells that express an increased 
amount of the antigenic polypeptide on the cell surface. This preliminary screen can be followed by immunogenicity 

20 tests in mammals, such as mice. Finally, in preferred embodiments, those recombinant nucleic acids are tested as 
genetic vaccines for their ability to protect a test animal against challenge by the virus. 

[0065] The flaviviruses are another example of a viral pathogen for which the methods of the invention are useful 
for obtaining a recombinant polypeptide or genetic vaccine that is effective against a viral pathogen. The flaviviruses 
consist of three clusters of antigenically related viruses: Dengue 1 -4 (62-77% identity), Japanese, St. Louis and Murray 

25 Valley encephalitis viruses (75-82% identity), and the tick-borne encephalitis viruses (77-96% identity). Dengue virus 
can induce protective antibodies against SLE and Yellow fever (40-50% identity), but few efficient vaccines are avail- 
able. To obtain genetic vaccines and recombinant polypeptides that exhibit enhanced cross-reactivity and immuno- 
genicity, the polynucleotides that encode envelope proteins of related viruses are subjected to DNA shuffling. The 
resulting recombinant polynucleotides can be tested, either as genetic vaccines or by using the expressed polypeptides, 

30 for ability to induce a broadly reacting neutralizing antibody response. Finally, those clones that are favorable in the 
preliminary screens can be tested for ability to protect a test animal against viral challenge. 

[0066] Viral antigens that can be evolved by DNA shuffling for improved activity as vaccines include, but are not 
limited to, influenza A virus N2 neuraminidase (Kilbourne etai. (1 995) Vaccine 13: 1 799-1 803); Dengue virus envelope 
(E) and premembrane (prM) antigens (Feighny etai. (1994) Am. J. Trop. Med. Hyg. 50: 322-328; Putnak et al. (1996) 

35 Am. J. Trop. Med. Hyg. 55: 504-10); HIV antigens Gag, Pol, Vif and Nef (Vogt etai. (1995) Vaccine 13: 202-208); HIV 
antigens gp120 and gp160 (Achour etai. (1995) Cell. Mol. Biol. 41 : 395-400; Hone etai. (1994) Dev. Biol. Stand. 82: 
159-1 62); gp41 epitope of human immunodeficiency virus (Eckhart etai. (1996) J. Gen. Virol. 77:2001-2008); rotavirus 
antigen VP4 (Mattion etai. (1995) J. Virol. 69: 5132-5137); the rotavirus protein VP7 or VP7sc (Emslie etai. (1995) J. 
Virol. 69: 1747-1754; Xu etai. (1995) J. Gen. Virol. 76/1971-1980); herpes simplex virus (HSV) glycoproteins gB, gC, 

40 gD, gE, gG, gH, and gl (Fleck et al. (1994) Med. Microbiol. Immunol. (Berl) 183: 87-94 [Mattion, 1995]; Ghiasi et al. 
(1995) Invest. Ophthalmol. Vis. Sci. 36: 1352-1360; McLean etai. (1994) J. Infect. Dis. 170: 1100-1109); immediate- 
early protein ICP47 of herpes simplex virus-type 1 (HSV-1) (Banks et al. (1994) Virology 200: 236-245); immediate- 
early (IE) proteins ICP27, ICP0, and ICP4 of herpes simplex virus (Manickan et al. (1995) J. Virol. 69: 4711-4716); 
influenza virus nucleoprotein and hemagglutinin (Deck et al. (1997) Vaccine 15: 71-78; Fu et al. (1997) J. Virol. 71 : 

45 2715-2721 ); B 19 parvovirus capsid proteins VP1 (Kawase et al. (1 995) Virology 211 : 359-366) or VP2 (Brown et al. 
(1994) Virology 198: 477-488); Hepatitis B virus core and e antigen (Schodel etai. (1996) Interviroiogy 39: 104-106); 
hepatitis B surface antigen (Shiau and Murray (1997) J. Med. Virol. 51 : 159-166); hepatitis B surface antigen fused to 
the core antigen of the virus (Id.); Hepatitis B virus core-preS2 particles (Nemeckova et al. (1996) Acta Virol. 40: 
273-279); HBV preS2-S protein (Kutinova et al. (1996) Vaccine 14: 1045-1052); VZV glycoprotein I (Kutinova et al. 

50 (1996) Vaccine J \4\ 1045-1052); rabies virus glycoproteins (Xiang etai. (1994) Virology 1 99: 132-140; Xuan etai. (1995) 
Virus Res. 36: 151 -161) or ribonucleocapsid (Hooper et al. (1994) Proc. Natl. Acad. Sci. USASA :1 0908-1 0912); human 
cytomegalovirus (HCMV) glycoprotein B (UL55) (Britt etai. (1 995) J. Infect. Dis. 171:1 8-25); the hepatitis C virus (HCV) 
nucleocapsid protein in a secreted or a nonsecreted form, or as a fusion protein with the middle (pre-S2 and S) or 
major (S) surface antigens of hepatitis B virus (HBV) (Inchauspe etai. (1997) DNA Cell Biol. 16: 185-195; Major etai. 

55 (1995) J. Virol. 69: 5798-5805); the hepatitis C virus antigens: the core protein (pC); E1 (pE1) and E2 (pE2) alone or 
as fusion proteins (Saito etai. (1997) Gastroenterology 112: 1321-1330); the gene encoding respiratory syncytial virus 
fusion protein (PFP-2) (Falsey and Walsh (1996) Vaccine 14:1214-121 8; Piedraef a/. (1996) Pediatr. infect. Dis. J. 15: 
23-31); the VP6 and VP7 genes of rotaviruses (Choi etai. (1997) Virology 232: 129-138; Jin etai. (1996) Arch. Virol. 
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141 : 2057-2076); the E1 , E2, E3, E4, E5, E6 and E7 proteins of human papillomavirus (Brown et al. (1994) Virology 
201 : 46-54; Dillner etal. (1995) Cancer Detect. Prev. 19: 381-393; Krul etal. (1996) Cancer Immunol. Immunother. 43: 
44-48; Nakagawa et al. (1997) J. Infect. Dis, 175: 927-931); a human T-lymphotropic virus type I gag protein (Porter 
et al. (1995) J. Med. Virol. 45: 469-474); Epstein-Barr virus (EBV) gp340 (Mackett ef al. (1996) J. Med. Virol. 50: 
5 263-271); the Epstein-Barr virus (EBV) latent membrane protein LMP2 (Lee et al. (1996) Eur. J. Immunol. 26: 
1 875-1 883); Epstein-Barr virus nuclear antigens 1 and 2 (Chen and Cooper (1 996) J. Virol. 70: 4849-4853; Khanna et 
al. (1995) Virology 214: 633-637); the measles virus nucleoprotein (N) (Fooks et al. (1995) Virology 210: 456-465); 
and cytomegalovirus glycoprotein gB (Marshall etal. (1994) J. Med. Virol. 43: 77-83) or glycoprotein gH (Rasmussen 
etal. (1994) J. Infect. Dis. 170: 673-677). 

10 

3. Parasites 

[0067] Antigens from parasites can also be optimized by the methods of the invention. These include, but are not 
limited to, the schistosome gut-associated antigens CAA (circulating anodic antigen) and CCA (circulating cathodic 

15 antigen) in Schistosoma mansoni, S. haematobium or S. japonicum (Deelder ef al. (1996) Parasitology 112: 21-3 5); 
a multiple antigen peptide (MAP) composed of two distinct protective antigens derived from the parasite Schistosoma 
mansoni (Ferru et al. (1997) Parasite Immunol. 19: 1-11); Leishmania parasite surface molecules (Lezama-Davila 
(1997) Arch. Med. Res. 28: 47-53); third-stage larval (L3) antigens of L. loa (Akue et al. (1997) Infecr. Dis. 175: 
1 58-63); the genes. Tarns 1 -1 and Tarns 1 -2, encoding the 30-and 32-kDa major merozoite surface antigens of Theileria 

20 annulata (Ta) (d'Oliveira et al. (1 996) Gene 1 72: 33-39); Plasmodium falciparum merozoite surface antigen 1 or 2 (al- 
Yaman etal. (1 995) Trans. R. Soc. Trop. Med. Hyg. 89: 555-559; Beck etal. (1 997) J. Infect. Dis. 1 75: 921 -926; Rzepczyk 
et al. (1997) Infect. Immun. 65: 1098-1100)-. circumsporozoite (CS) protein-based B-epitopes from Plasmodium 
berghei, (PPPPNPND) 2 (SEQ ID NO:23) and Plasmodium yoelii, (QGPGAP) 3 QG (SEQ ID NO:24), along with a P. 
berghei T-helper epitope KQIRDSITEEWS (SEQ ID NO:25)(Reed et al. (1997) Vaccine 15: 482-488); NYVAC-Pf7 

25 encoded Plasmodium falciparum antigens derived from the sporozoite (circumsporozoite protein and sporozoite sur- 
face protein 2), liver (liver stage antigen 1), blood (merozoite surface protein 1, serine repeat antigen, and apical 
membrane antigen 1 ), and sexual (25-kDa sexual-stage antigen) stages of the parasite life cycle were inserted into a 
single NY VAC genome to generate NYVAC-Pf7 (Tine et al. (1996) Infect. Immun. 64: 3833-3844); Plasmodium falci- 
parum antigen Pfs230 (Williamson etal. (1996) Mol. Biochem. Parasitol. 78: 161-169); Plasmodium falciparum apical 

30 membrane antigen (AMA-1) (Lai et al. (1996) Infect. Immun. 64: 1054-1059); Plasmodium falciparum proteins Pfs28 
and Pfs25 (Duffy and Kaslow (1 997) Infect. Immun. 65: 1 1 09-1 1 1 3); Plasmodium falciparum merozoite surface protein, 
MSP1 (Hui etal. (1996) Infect. Immun. 64: 1502-1509); the malaria antigen Pf332 (Ahlborg etal. (1996) Immunology 
88: 630-635); Plasmodium falciparum erythrocyte membrane protein 1 (Baruch et al. (1995) Proc. Nat'l. Acad. Sci. 
USA 93: 3497-3502; Baruch etal. (1995) Cell 82: 77-87); Plasmodium falciparum merozoite surface antigen, PfMSP- 

35 1 (Egan etal. (1996) J. Infect. Dis. 173: 765-769); Plasmodium falciparum antigens SERA, EBA-175, RAP1 and RAP2 
(Riley (1 997) J. Pharm. Pharmacol. 49: 21 -27); Schistosoma japonicum paramyosin (Sj97) or fragments thereof (Yang 
et al. (1995) Biochem. Biophys. Res. Commun. 212: 1029-1039); and Hsp70 in parasites (Maresca and Kobayashi 
(1994) Experientia 50: 1067-1074). 

40 4. Allerg y 

[0068] The invention also provides methods of obtaining reagents that are useful for treating allergy. In one embod- 
iment, the methods involve making a library of recombinant polynucleotides that encode an allergen, and screening 
the library to identify those recombinant polynucleotides that exhibit improved properties when used as immunothera- 

45 peutic reagents for treating allergy. For example, specific immunotherapy of allergy using natural antigens carries a 
risk of inducing anaphylaxis, which can be initiated by cross-linking of high-affinity IgE receptors on mast cells. There- 
fore, allergens that are not recognized by pre-existing IgE are desirable. The methods of the invention provide methods 
by which one can obtain such allergen variants. Another improved property of interest is induction of broader immune 
responses, increased safety and efficacy. 

so [0069] Synthesis of polyclonal and allergen-specific IgE requires multiple interactions between B cells, T cells and 
professional antigen-presenting cells (APC). Activation of naive, unprimed B cells is initiated when specific B cells 
recognize the allergen by cell surface immunoglobulin (slg). However, costimulatory molecules expressed by activated 
T cells in both soluble and membrane-bound forms are necessary for differentiation of B cells into IgE-secreting plasma 
cells. Activation ofT helper cells requires recognition of an antigenic peptide in the context of MHC class II molecules 

55 on the plasma membrane of APC, such as monocytes, dendritic cells, Langerhans cells or primed B cells. Professional 
APC can efficiently capture the antigen and the peptide-MHC class II complexes are formed in apost-Golgi, proteolytic 
intracellular compartment and subsequently exported to the plasma membrane, where they are recognized by T cell 
receptor (TCR) (Whitton (1998) Curr. Top. Microbiol. Immunol. 232: 1-13). In addition, activated B cells express CD80 
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(B7-1 ) and CD86 (B7-2, B70), which are the counter receptors for CD28 and which provide a costimulatory signal for 
T cell activation resulting in T cell proliferation and cytokine synthesis. Since allergen-specific T cells from atopic indi- 
viduals generally belong to the T H 2 cell subset, activation of these cells also leads to production of IL-4 and IL-13, 
which, together with membrane-bound costimulatory molecules expressed by activated T helper cells, direct B cell 
differentiation into IgE-secreting plasma cells. 

[0070] Mast cells and eosinophils are key cells in inducing allergic symptoms in target organs. Recognition of specific 
antigen by IgE bound to high-affinity IgE receptors on mast cells, basophils or eosinophils results in crosslinking of the 
receptors leading to degranulation of the cells and rapid release of mediator molecules, such as histamine, prostag- 
landins and leukotrienes, causing allergic symptoms. 

[0071] Immunotherapy of allergic diseases currently includes hyposensibilization treatments using increasing doses 
of allergen injected to the patient. These treatments result skewing of immune responses towards T H 1 phenotype and 
increase the ratio of IgG/lgE antibodies specific for allergens. Because these patients have circulating IgE antibodies 
specific for the allergens, these treatments include significant risk of anaphylactic reactions. In these reactions, free 
circulating allergen is recognized by IgE molecules bound to high-affinity IgE receptors on mast cells and eosinophils. 
Recognition of the allergen results in crosslinking of the receptors leading to release of mediators, such as histamine, 
prostaglandins, and leukotrienes, which cause the allergic symptoms, and occasionally anaphylactic reactions. Other 
problems associated with hyposensibilization include low efficacy and difficulties in producing allergen extracts repro- 
ducibly. 

[0072] The methods of the invention provide a means to obtain allergens that, when used in genetic vaccines, provide 
a means of circumventing the problems that have limited the usefulness of previously known hyposensibilization treat- 
ments. For example, by expressing antigens on the surface of cells, such as muscle cells, the risk of anaphylactic 
reactions is significantly reduced. This can be conveniently achieved by using genetic vaccine vectors that encode 
transmembrane forms of allergens. The allergens can also be modified in such a way that they are efficiently expressed 
in transmembrane forms, further reducing the risk of anaphylactic reactions. Another advantage provided by the use 
of genetic vaccines for hyposensibilization is that the genetic vaccines can include cytokines and accessory molecules 
which further direct the immune responses towards the T H 1 phenotype, thus reducing the amount of IgE antibodies 
produced and increasing the efficacy of the treatments. To further reduce IgE production, one can administer the shuf- 
fled allergens using vectors that have been evolved to induce primarily IgG and IgM responses, with little or no IgE 
response (see, e.g., US Patent Application Ser. No. 09/021,769, filed February 11, 1998). 

[0073] In these methods, polynucleotides encoding known allergens, or homologs or fragments thereof (e.g., immu- 
nogenic peptides) are inserted into DNA vaccine vectors and used to immunize allergic and asthmatic individuals. 
Alternatively, the shuffled allergens are expressed in manufacturing cells, such as E. coli or yeast cells, and subse- 
quently purified and used to treat the patients or prevent allergic disease. DNA shuffling or other recombination method 
can be used to obtain allergens that activate T cells but cannot induce anaphylactic reactions. For example, a library 
of recombinant polynucleotides that encode allergen variants can be expressed in cells, such as antigen presenting 
cells, which are than contacted with PBMC or T cell clones from atopic patients. Those library members that efficiently 
activate T H cells from the atopic patients can be identified by assaying for T cell proliferation, or by cytokine synthesis 
(e.g., synthesis of IL-2, IL-4, IFN-y. Those recombinant allergen variants that are positive in the in vitro tests can then 
be subjected to in vivo testing. 

[0074] Examples of allergies that can be treated include, but are not limited to, allergies against house dust mite, 
grass pollen, birch pollen, ragweed pollen, hazel pollen, cockroach, rice, olive tree pollen, fungi, mustard, bee venom. 
Antigens of interest include those of animals, including the mite (e.g., Dermatophagoides pteronyssinus, Dermatopha- 
goides farinae, Blomia tropicalis), such as the allergens derpl (Scobie et af. (1994) Biochem. Soc. Trans. 22: 448S; 
Yssel etal. (1992) J. Immunol. 148: 738-745), der p2 (Chua etal. (1996) Clin. Exp. Allergy 26: 829-837), der p3 (Smith 
and Thomas (1996) Clin. Exp. Allergy 26: 571-579), der p5, der p V (Lin et al. (1994) J. Allergy Clin. Immunol. 94: 
989-996), der p6 (Bennett and Thomas (1996) Clin. Exp. Allergy 26: 1150-1154), der p 7 (Shen etal. (1995) Clin. Exp. 
Allergy 25: 416-422), derf2 (Yuuki etal. (1997) Int. Arch. Allergy Immunol. 112: 44-48), derf3 (Nishiyama et al. (1995) 
FEBS Lett. 377: 62-66), der f7 (Shen etal. (1995) Clin. Exp. Allergy 25: 1000-1006); Mag 3 (Fujikawa etal. (1996) 
Mol. Immunol. 33: 31 1 -31 9). Also of interest as antigens are the house dust mite allergens Tyr p2 (Eriksson et al. (1 998) 
Eur. J. Biochem. 251 : 443-447), Lep d1 (Schmidt etal. (1995) FEBS Lett. 370:11-14), and glutathione S-transferase 
(O'Neill et al. (1 995) Immunol Lett. 48: 1 03-1 07); the 25,589 Da, 21 9 amino acid polypeptide with homology with glu- 
tathione S-transferases (O'Neill etal. (1994) Biochim. Biophys. Acta. 1219: 521-528); Bio t 5 (Arruda etal. (1995) Int. 
Arch. Allergy Immunol. 1 07: 456-457); bee venom phospholipase A2 (Carballido etal. (1 994) J. Allergy Clin. Immunol. 
93: 758-767; Jutel etal. (1995) J. Immunol. 154: 4187-4194); bovine dermal/dander antigens BDA 11 (Rautiainen et 
al. (1995) J. Invest. Dermatol. 105: 660-663) and BDA20 (Mantyjarvi et al. (1996) J. Allergy Clin. Immunol. 97: 
1297-1303); the major horse allergen Equ d (Gregoire et al. (1996) J. Biol. Chem. 271: 32951-32959); Jumper ant 
M. pilosula allergen Myr p I and its homologous allergenic polypeptides Myr p2 (Donovan et al. (1996) Biochem. Mol. 
Biol. Int. 39: 877-885); 1-13, 14, 16 kD allergens of the mite Blomia tropicalis (Caraballo et al. (1996) J. Allergy Clin. 
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Immunol. 98: 573-579); the cockroach allergens Bla g Bd90K (Helm etai. (1 996) J. Allergy Clin. Immunol. 98: 1 72-80) 
and Blag 2 (Arruda et al. (1995) J. Biol. Chem. 270: 19563-19568); the cockroach Cr-PI allergens (Wu etai. (1996) 
J. Biol. Chem. 271 : 17937-17943); fire ant venom allergen, Sol i 2 (Schmidt etai. (1996) J. Allergy Clin. Immunol. 98: 
82-88); the insect Chironomus thummi major allergen Chi t 1-9 (Kipp et al. (1996) Int. Arch. Allergy Immunol. 110: 

5 348-353); dog allergen Can f 1 or cat allergen Fel d 1 (Ingram et al. (1995) J. Allergy Clin. Immunol. 96: 449-456); 
albumin, derived, for example, from horse, dog or cat (Goubran Botros et al. (1996) Immunology 88: 340-347); deer 
allergens with the molecular mass of 22 kD, 25 kD or 60 kD (Spitzauer et al. (1997) Clin. Exp. Allergy 27: 196-200); 
and the 20 kd major allergen of cow (Ylonen etai. (1994) J. Allergy Clin. Immunol. 93: 851-858). 
[0075] Pollen and grass allergens are also useful in vaccines, particularly after optimization of the antigen by the 

10 methods of the invention. Such allergens include, for example, Hor v9 (Astwood and Hill (1 996) Gene 1 82: 53-62, Lig 
v1 (Batanero et al. (1996) Clin. Exp. Allergy 26: 1401-1410); Lol p 1 (Muller etai. (1996) Int. Arch. Allergy Immunol. 
109: 352-355), Lol p II (Tamborini etai. (1995) Mol. Immunol. 32: 505-513), Lol pVA, Lol pVB (Ong etai. (1995) Mol. 
Immunol. 32: 295-302), Lol p 9 (Blaher et al. (1996) J. Allergy Clin. Immunol. 98: 124-132); Par J I (Costa et al. (1994) 
FEBS Lett. 341 : 1 82-1 86; Sallusto et al. (1 996) J. Allergy Clin. Immunol. 97: 627-637), Par j 2.01 01 (Duro et al. (1 996) 

15 FEBS Lett. 399:295-298); Betvl (Faber etai. (1996) J. Biol. Chem. 271: 19243-19250), Betv2 (Rihs etai. (1994) int. 
Arch. Allergy Immunol. 105: 190-194); Dac g3 (Guerin-Marchand et al. (1996) Mol. Immunol. 33: 797-806); Phi p 1 
(Petersen etai. (1995) J. Allergy Clin. Immunol. 95: 987-994), Phi p 5 (Muller etai. (1996) int. Arch. Allergy Immunol. 
109: 352-355), Phi p 6 (Petersen et al. (1995) int. Arch. Allergy Immunol. 108: 55-59); Cry j I (Sone et al. (1994) 
Biochem. Biophys. Res. Commun. 199: 619-625), Cry j II (Namba et al. (1994) FEBS Lett. 353: 124-128); Cor a 1 

20 (Schenk et al. (1 994) Eur. J. Biochem. 224: 71 7-722); cyn d1 (Smith etai. (1 996) J. Allergy Clin. Immunol. 98: 331 -343), 
cyn d7 (Suphioglu et al. (1 997) FEBS Lett. 402: 1 67-1 72); Pha a 1 and isoforms of Pha a 5 (Suphioglu and Singh 

(1995) Clin. Exp. Allergy 25: 853-865); Chao 1 (Suzuki et al. (1996) Mol. Immunol. 33: 451-460); profilin derived, e.g, 
from timothy grass or birch pollen (Valenta etai. (1994) Biochem. Biophys. Res. Commun. 199: 106-118); P0149 (Wu 
etai. (1996) Plant Mol. Biol. 32:1037-1042); Ory s1 (Xu etai. (1995) Gene 164: 255-259); and Amb a V and Amb t 5 

25 (Kim etai. (1996) Mol. Immunol. 33: 873-880; Zhu etai. (1995) J. Immunol. 155: 5064-5073). 

[0076] Vaccines against food allergens can also be developed using the methods of the invention. Suitable antigens 
for shuffling include, for example, profilin (Rihs etai. (1994) int. Arch. Allergy Immunol. 105: 190-194); rice allergenic 
cDNAs belonging to the alpha-amylase/trypsin inhibitor gene family (Alvarez etai. (1 995) Biochim Biophys Acta 1251 : 
201-204); the main olive allergen, Ole e I (Lombardero etai. (1994) Clin Exp Allergy 24: 765-770); Sin a 1, the major 

30 allergen from mustard (Gonzalez De La Pena etai. (1996) Eur J Biochem. 237: 827-832); parvalbumin, the major 
allergen of salmon (Lindstrom etai. (1996) Scand.J. Immunol. 44: 335-344); apple allergens, such as the major allergen 
Mai d 1 (Vanek-Krebitz etai. (1995) Biochem. Biophys. Res. Commun. 214: 538-551); and peanut allergens, such as 
Ara h I (Burks etai. (1995) J. Clin. Invest. 96: 1715-1721). 

[0077] The methods of the invention can also be used to develop recombinant antigens that are effective against 
35 allergies to fungi. Fungal allergens useful in these vaccines include, but are not limited to, the allergen, Cla h III, of 
Cladosporium herbarum (Zhang etai. (1995) Immunol. 154: 71 0-71 7); the allergen Psic2, a fungal cyclophilin, from 
the basidiomycete Psilocybe cubensis (Horner et al. (1995) Int. Arch. Allergy Immunol. 107: 298-300); hsp 70 cloned 
from acDNA library of Cladosporium herbarum (Zhang etai. (1996) Clin Exp Allergy 26: 88-95); the 68 kD allergen of 
Penicillium notatum (Shen et al. (1995) Clin. Exp. Allergy 26: 350-356); aldehyde dehydrogenase (ALDH) (Achatz et 
40 al. (1995) Mol Immunol. 32: 213-227); enolase (Achatz etai. (1995) Mol. Immunol. 32: 213-227); YCP4 (Id.); acidic 
ribosomal protein P2 (Id.). 

[0078] Other allergens that can be used in the methods of the invention include latex allergens, such as a major 
allergen (Hev b 5) from natural rubber latex (Akasawa et al. (1996) J. Biol. Chem. 271: 25389-25393; Slater et al. 

(1996) J. Biol. Chem. 271 : 25394-25399). 

45 [0079] The invention also provides a solution to another shortcoming of vaccination as a treatment for allergy and 
asthma. While genetic vaccination primarily induces CD8 + T cell responses, induction of allergen-specific IgE respons- 
es is dependent on CD4 + T cells and their help to B cells. T H 2-type cells are particularly efficient in inducing IgE 
synthesis because they secrete high levels of IL-4, IL-5 and IL-13, which direct Ig isotype switching to IgE synthesis. 
IL-5 also induces eosinophilia. The methods of the invention can be used to develop recombinant antigens that effi- 

50 ciently induce CD4 + T cell responses, and direct differentiation of these cells towards the T H 1 phenotype. 

5. Inflammatory and Autoimmune Diseases 

[0080] Autoimmune diseases are characterized by immune response that attacks tissues or cells of ones own body, 
55 or pathogen-specific immune responses that also are harmful for ones own tissues or cells, or non-specific immune 
activation which is harmful for ones own tissues or cells. Examples of autoimmune diseases include, but are not limited 
to, rheumatoid arthritis, SLE, diabetes mellitus, myastheniagravis, reactive arthritis, ankylosing spondylitis, and multiple 
sclerosis. These and other inflammatory conditions, including IBD, psoriasis, pancreatitis, and various immunodefi- 
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ciencies, can be treated using antigens that are optimized using the methods of the invention. 
[0081] These conditions are often characterized by an accumulation of inflammatory cells, such as lymphocytes, 
macrophages, and neutrophils, at the sites of inflammation. Altered cytokine production levels are often observed, with 
increased levels of cytokine production. Several autoimmune diseases, including diabetes and rheumatoid arthritis, 
are linked to certain MHC haplotypes. Other autoimmune-type disorders, such as reactive arthritis, have been shown 
to be triggered by bacteria such as Yersinia and Shigella, and evidence suggests that several other autoimmune dis- 
eases, such as diabetes, multiple sclerosis, rheumatoid arthritis, may also be initiated by viral or bacterial infections 
in genetically susceptible individuals. 

[0082] Current strategies of treatment generally include anti-inflammatory drugs, such as NSAI D or cyclosporin, and 
antiproliferative drugs, such as methotrexate. These therapies are non-specific, so a need exists for therapies having 
greater specificity, and for means to direct the immune responses towards the direction that inhibits the autoimmune 
process. 

[0083] The present invention provides several strategies by which these needs can be fulfilled. First, the invention 
provides methods of obtaining antigens having greater tolerogenicity and/or have improved antigenicity. In a preferred 
embodiment, the antigens prepared according to the invention exhibit improved induction of tolerance by oral delivery. 
Oral tolerance is characterized by induction of immunological tolerance after oral administration of large quantities of 
antigen. In animal models, this approach has proven to be a very promising approach to treat autoimmune diseases, 
and clinical trials are in progress to address the efficacy of this approach in the treatment of human autoimmune 
diseases, such as rheumatoid arthritis and multiple sclerosis. It has also been suggested that induction of oral tolerance 
against viruses used in gene therapy might reduce the immunogenicity of gene therapy vectors. However, the amounts 
of antigen required for induction of oral tolerance are very high and the methods of the invention provide a means for 
obtaining antigens that exhibit a significant improvement in induction of oral tolerance. 

[0084] Expression library immunization (Barry etal, (1 995) Nature 377: 632) is a particularly useful method of screen- 
ing for optimal antigens for use in genetic vaccines. For example, to identify autoantigens present in Yersinia, Shigella, 
and the like, one can screen for induction of T cell responses in HLA-B27 positive individuals. Complexes that include 
epitopes of bacterial antigens and MHC molecules associated with autoimmune diseases, e.g., HLA-B27 in association 
with Yersinia antigens can be used in the prevention of reactive arthritis and ankylosing spondylitis in HLA-B27 positive 
individuals. 

[0085] Screening of optimized antigens can be done in animal models which are known to those of skill in the art. 
Examples of suitable models for various conditions include collagen induced arthritis, the NFS/sId mouse model of 
human Sjogren's syndrome; a 1 20 kD organ-specific autoantigen recently identified as an analog of human cytoskeletal 
protein (a-fodrin (Haneji etal. (1997) Science 276: 604), the New Zealand Black/White F1 hybrid mouse model of 
human SLE, NOD mice, a mouse model of human diabetes mellitus, fas/fas ligand mutant mice, which spontaneously 
develop autoimmune and lymphoproliferative disorders (Watanabe-Fukunaga et ai. (1992) Nature 356: 314), and ex- 
perimental autoimmune encephalomyelitis (EAE), in which myelin basic protein induces a disease that resembles 
human multiple sclerosis. 

[0086] Autoantigens that can be shuffled according to the methods of the invention and used in vaccines for treating 
multiple sclerosis include, but are not limited to, myelin basic protein (Stinissen et ai. (1996) J. Neurosci. Res. 45: 
500-511) or a fusion protein of myelin basic protein and proteolipid protein (Elliott et ai. (1996) J. Clin. Invest. 98: 
1602-1612), proteolipid protein (PLP) (Rosener etal. (1997) J. Neuroimmunol. 75: 28-34), 2',3'-cyclic nucleotide 3'- 
phosphodiesterase (CNPase) (Rosener et ai. (1997) J. Neuroimmunol. 75: 28-34), the Epstein Barr virus nuclear an- 
tigen-1 (EBNA-1) (Vaughan etal. (1996) J. Neuroimmunol. 69: 95-102), HSP70 (Salvetti etal. (1996) J. Neuroimmunol. 
65: 143-53; Feldmann etal. (1996) Ce//85: 307). 

[0087] Target antigens that, after shuffling according to the methods of the invention, can be used to treat scleroder- 
ma, systemic sclerosis, and systemic lupus erythematosus include, for example, (-2-GPI, 50 kDa glycoprotein (Blank 
et ai. (1994) J. Autoimmun. 7: 441-455), Ku (p70/p80) autoantigen, or its 80-kd subunit protein (Hong et ai. (1994) 
Invest. Ophthalmol. Vis. Sci. 35: 4023-4030; Wang et ai (1 994) J. Cell Sci. 1 07: 3223-3233), the nuclear autoantigens 
La (SS-B) and Ro (SS-A) (Huang etal. (1997) J. Clin. Immunol. 77:212-219; Igarashi etal. (1995) Autoimmunity 22\ 
33-42; Keech etal. (1996) Clin. Exp. Immunol, 104: 255-263; Manoussakis etal. (1995) J. Autoimmun. 8: 959-969; 
Topfer etal. (1995) Proc. Nat 7. Acad. Sci. USA 92: 875-879), proteasome (-type subunit C9 (Feist etal. (1996) J. Exp. 
Med. 184: 1313-1318), Scleroderma antigens Rpp30, Rpp 38 or Scl-70 (Eder etal. (1997) Proc. Nat'l. Acad. Sci. USA 
94: 1101-1106; Hietarinta etal. (1994) Br. J. Rheumatol. 33: 323-326), the centrosome autoantigen PCM-1 (Bao etal. 
(1995) Autoimmunity 22: 219-228), polymyositis-scleroderma autoantigen (PM-Scl) (Kho etal. (1997) J. Biol. Chem. 
272: 13426-13431), scleroderma (and other systemic autoimmune disease) autoantigen CENP-A (Muro etal. (1996) 
Clin. Immunol. Immunopathol. 78: 86-89), U5, a small nuclear ribonucleoprotein (snRNP) (Okano et ai. (1996) Clin. 
Immunol. Immunopathol. 81 : 41-47), the 100-kd protein of PM-Scl autoantigen (Ge etal. (1996) Arthritis Rheum. 39: 
1588-1595), the nucleolar U3- and Th(7-2) ribonucleoproteins (Verheijen et ai. (1994) J. Immunol. Methods 169: 
173-182), the ribosomal protein L7 (Neu etal. (1995) Clin. Exp. Immunol. 100: 198-204), hPopI (Lygerou etal. (1996) 
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EMBO J. 15: 5936-5948), and a 36-kd protein from nuclear matrix antigen (Deng et al. (1996) Arthritis Rheum. 39: 
1300-1307). 

[0088] Hepatic autoimmune disorders can also be treated using improved recombinant antigens that are prepared 
according to the methods described herein. Among the antigens that are useful in such treatments are the cytochromes 
P450 and UDP-glucuronosyl-transferases (Obermayer-Straub and Manns (1996) BaiHieres Clin. Gastroenterol. 10: 
501-532), the cytochromes P450 2C9 and P450 1 A2 (Bourdi etal. (1996) Chem. Res. Toxicol. 9: 1159-1166; Clemente 
et al. (1997) J. Clin. Endocrinol. Metab. 82: 1353-1361), LC-1 antigen (Klein etal. (1996) J. Pediatr. Gastroenterol. 
Nutr. 23: 461 -465), and a 230-kDa Golgi-associated protein (Funaki et al. (1 996) Cell Struct. Funct. 21 : 63-72). 
[0089] For treatment of autoimmune disorders of the skin, useful antigens include, but are not limited to, the 450 kD 
human epidermal autoantigen (Fujiwara et al. (1996) J. Invest. Dermatol. 106: 1125-1130), the 230 kD and 180 kD 
bullous pemphigoid antigens (Hashimoto (1995) Keio J. Med. 44: 115-123; Murakami et al. (1996) J. Dermatol. Sci. 
13: 112-117), pemphigus foliaceus antigen (desmoglein 1), pemphigus vulgaris antigen (desmoglein3), BPAg2, BPAgl , 
and type VII collagen (Batteux etal. (1997) J. Clin. Immunol. 17: 228-233; Hashimoto etal. (1996) J. Dermatol. Sci. 
12:10-17), a 168-kDa mucosal antigen in a subset of patients with cicatricial pemphigoid (Ghohestani et al. (1996) J. 
Invest. Dermatol. 107: 136-139), and a218-kd nuclear protein (218-kd Mi-2) (Seelig etal. (1995) Arthritis Rheum. 38: 
1389-1399). 

[0090] The methods of the invention are also useful for obtaining improved antigens for treating insulin dependent 
diabetes mellitus, using one or more of antigens which include, but are not limited to, insulin, proinsulin, GAD65 and 
GAD67, heat-shock protein 65 (hsp65), and islet-cell antigen 69 (ICA69) (French et al. (1997) Diabetes 46: 34-39; 
Roep (1996) Diabetes 45: 1147-1156; Schloot etal. (1997) Diabetologia 40: 332-338), viral proteins homologous to 
GAD65 (Jones and Crosby (1 996) Diabetologia 39: 1 31 8-1 324), islet cell antigen-related protein-tyrosine phosphatase 
(PTP) (Cui etal. (1996) J. Biol. Chem. 271: 24817-24823), GM2-1 ganglioside (Cavallo etal. (1996) J. Endocrinol. 
150: 113-120; Dotta etal. (1996) Diabetes 45: 1193-1196), glutamic acid decarboxylase (GAD) (Nepom (1995) Curr. 
Opin. Immunol. 7: 825-830; Panina-Bordignon etal. (1995) J. Exp. Med. 181 : 1923-1927), an islet cell antigen (ICA69) 
(Karges etal. (1997) Biochim. Biophys. Acta 1360: 97-101 ; Roep etal. (1996) Eur. J. Immunol. 26: 1285-1289), Tep69, 
the single T cell epitope recognized by T cells from diabetes patients (Karges et al. (1997) Biochim. Biophys. Acta 
1 360: 97-1 01 ), ICA 51 2, an autoantigen of type I diabetes (Solimena et al. (1 996) EMBO J. 15: 21 02-21 1 4), an islet- 
cell protein tyrosine phosphatase and the 37-kDa autoantigen derived from it in type 1 diabetes (including IA-2, IA-2) 
(La Gasse etal. (1997) Mol. Med. 3: 163-173), the 64 kDa protein from ln-111 cells or human thyroid follicular cells 
that is immunoprecipitated with sera from patients with islet cell surface antibodies (ICSA) (Igawaefa/. (1996) Endocr. 
J. 43: 299-306), phogrin, a homologue of the human transmembrane protein tyrosine phosphatase, an autoantigen of 
type 1 diabetes (Kawasaki et al. (1996) Biochem. Biophys. Res. Commun. 227: 440-447), the 40 kDa and 37 kDa 
tryptic fragments and their precursors IA-2 and IA-2 in IDDM (Lampasona et al. (1996) J. Immunol. 157: 2707-2711 ; 
Notkins et al. (1996) J. Autoimmun. 9: 677-682), insulin or a cholera toxoid-insulin conjugate (Bergerot et al. (1997) 
Proc. Natl. Acad. Sci. USA 94: 4610-4614), carboxypeptidase H, the human homologue of gp330, which is a renal 
epithelial glycoprotein involved in inducing Heymann nephritis in rats, and the 38-kD islet mitochondrial autoantigen 
(Arden etal. (1996) J. Clin. Invest. 97: 551-561. 

[0091 ] Rheumatoid arthritis is another condition that is treatable using optimized antigens prepared according to the 
present invention. Useful antigens for rheumatoid arthritis treatment include, but are not limited to, the 45 kDa DEK 
nuclear antigen, in particular onset juvenile rheumatoid arthritis and iridocyclitis (Murray et al. (1997) J. Rheumatol. 
24: 560-567), human cartilage glycoprotein-39, an autoantigen in rheumatoid arthritis (Verheijden etal. (1997) Arthritis 
Rheum. 40: 1115-1125), a 68k autoantigen in rheumatoid arthritis (Blass etal. (1997) Ann. Rheum. Dis. 56: 317-322), 
collagen (Rosloniec etal. (1995) J. Immunol. 155: 4504-4511), collagen type II (Cook et al. (1996) Arthritis Rheum. 
39: 1720-1727; Trentham (1996) Ann. N. Y.Acad. Sci. 778: 306-314), cartilage link protein (Guerassimov etal. (1997) 
J. Rheumatol. 24: 959-964), ezrin, radixin and moesin, which are auto-immune antigens in rheumatoid arthritis (Wagat- 
suma etal. (1996) Mol. Immunol. 33: 1171-1176), and mycobacterial heat shock protein 65 (Rag no etal, (1997) Arthritis 
Rheum. 40: 277-283). 

[0092] Also among the conditions for which one can obtain an improved antigen suitable for treatment are autoim- 
mune thyroid disorders. Antigens that are useful for these applications include, for example, thyroid peroxidase and 
the thyroid stimulating hormone receptor (Tandon and Weetman (1 994) J. R. Coll. Physicians Lond. 28: 1 0-1 8), thyroid 
peroxidase from human Graves' thyroid tissue (Gardas etal. (1997) Biochem. Biophys. Res. Commun. 234: 366-370; 
Zimmer et al. (1997) Histochem. Cell. Biol. 107: 115-120), a 64-kDa antigen associated with thyroid-associated oph- 
thalmopathy (Zhang etal. (1996) Clin. Immunol. Immunopathol. 80: 236-244), the human TSH receptor (Nicholson et 
al. (1996) J. Mol. Endocrinol. 16: 159-170), and the 64 kDa protein from ln-111 cells or human thyroid follicular cells 
that is immunoprecipitated with sera from patients with islet cell surface antibodies (ICSA) (Igawaefa/. (1996) Endocr. 
J. 43: 299-306). 

[0093] Other conditions and associated antigens include, but are not limited to, Sjogren's syndrome (-fodrin; Haneji 
et al. (1997) Science 276: 604-607), myastenia gravis (the human M2 acetylcholine receptor or fragments thereof, 
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specifically the second extracellular loop of the human M2 acetylcholine receptor; Fu et al. (1996) Clin. Immunol. 
Immunopathol. 78: 203-207), vitiligo (tyrosinase; Fishman etal. (1997) Cancer 79: 1461-1464), a 450 kD human epi- 
dermal autoantigen recognized by serum from individual with blistering skin disease, and ulcerative colitis (chromo- 
somal proteins HMG1 and HMG2; Sobajima etal. (1997) Clin. Exp. Immunol. 107: 135-140). 

5 

6. Cancer 

[0094] Immunotherapy has great promise for the treatment of cancer and prevention of metastasis. By inducing an 
immune response against cancerous cells, the body's immune system can be enlisted to reduce or eliminate cancer. 
10 Improved antigens obtained using the methods of the invention provide cancer immunotherapies of increased effec- 
tiveness compared to those that are presently available. 

[0095] One approach to cancer immunotherapy is vaccination using vaccines that include or encode antigens that 
are specific for tumor cells or by injecting the patients with purified recombinant cancer antigens. The methods of the 
invention can be used for obtaining antigens that exhibit an enhancement of immune responses against known tumor- 

15 specific antigens, and also to search for novel protective antigenic sequences. Antigens having optimized expression, 
processing, and presentation can be obtained as described herein. The approach used for each particular cancer can 
vary. For treatment of hormone-sensitive cancers (for example, breast cancer and prostate cancer), methods of the 
invention can be used to obtain optimized hormone antagonists. For highly immunogenic tumors, including melanoma, 
one can screen for recombinant antigens that optimally boost the immune response against the tumor. Breast cancer, 

20 in contrast, is of relatively low immunogenicity and exhibits slow progression, so individual treatments can be designed 
for each patient. Prevention of metastasis is also a goal in design of cancer vaccines. 

[0096] Among the tumor-specific antigens that can be used in the antigen shuffling methods of the invention are: 
bullous pemphigoid antigen 2, prostate mucin antigen (PMA) (Beckett and Wright (1995) Int. J Cancer 62: 703-710), 
tumor associated Thomsen-Friedenreich antigen (Dahlenborg et al. (1997) Int. J. Cancer 70: 63-71), prostate-specific 

25 antigen (PSA) (Dannull and Belldegrun (1997) Br. J Urol. 1: 97-103), luminal epithelial antigen (LEA. 135) of breast 
carcinoma and bladder transitional cell carcinoma (TCC) (Jones et al. (1997) Anticancer Res. 17: 685-687), cancer- 
associated serum antigen (CASA) and cancer antigen 125 (CA 125) (Kierkegaard et al. (1995) Gynecol. Oncol. 59: 
251-254), the epithelial glycoprotein 40 (EGP40) (Kievit etal. (1997) Int. J. Cancer 71 : 237-245), squamous cell car- 
cinoma antigen (SCC) (Lozzaefa/. (1 997) An ticancer Res. 17: 525-529), cathepsin E (Mota etal. (1997) Am. J. Pathol. 

30 1 50: 1223-1229), tyrosinase in melanoma (Fishman etal. (1997) Cancer79: 1461-1464), cell nuclear antigen (PCNA) 
of cerebral cavernomas (Natelet etal. (1997) Surg. Neurol. 47: 364-370), DF3/MUC1 breast cancer antigen (Apostol- 
opoulos etal. (1996) Immunol. Cell. Biol. 74: 457-464; Pandey etal. (1995) Cancer Res. 55: 4000-4003), carcinoem- 
bryonic antigen (Paone etal. (1996) J. Cancer Res. Clin. Oncol. 122: 499-503; Schlom etal. (1996) Breast Cancer 
Res. Treat. 38: 27-39), tumor-associated antigen CA 1 9-9 (Tolliver and O'Brien (1 997) South Med. J. 90: 89-90; Tsuruta 

35 etal. (1997) Urol. Int. 58: 20-24), human melanoma antigens MART-l/Melan-A27-35 andgp 100 (Kawakamiand Rosen- 
berg (1997) Int. Rev. Immunol. 14: 173-192; Zajacetal. (1997) Int. J. Cancerl\ : 491-496), the T and Tn pancarcinoma 
(CA) glycopeptide epitopes (Springer (1995) Crit. Rev. Oncog. 6: 57-85), a 35 kD tumor-associated autoantigen in 
papillary thyroid carcinoma (Lucas et al. (1996) Anticancer Res. 16:2493-2496), KH-1 adenocarcinoma antigen (Desh- 
pande and Danishefsky (1997) Nature 387: 164-166), the A60 mycobacterial antigen (Maes et al. (1996) J. Cancer 

40 Res. Clin. Oncol. 122: 296-300), heat shock proteins (HSPs) (Blachere and Srivastava (1995) Semin. Cancer Biol. 6: 
349-355), and MAGE, tyrosinase, melan-A and gp75 and mutant oncogene products (e.g., p53, ras, and HER-2/neu 
(Bueler and Mulligan (1996) Mol. Med. 2: 545-555; Lewis and Houghton (1995) Semin. Cancer Biol. 6: 321-327; The- 
obald etal. (1995) Proc. Nat'l. Acad. Sci. USA92: 11993-11997). 

45 7. Contraception 

[0097] Genetic vaccines that contain optimized antigens obtained by the methods of the invention are also useful 
for contraception. For example, genetic vaccines can be obtained that encode sperm cell specific antigens, and thus 
induce anti-sperm immune responses. Vaccination can be achieved by, for example, administration of recombinant 
50 bacterial strains, e.g. Salmonella and the like, which express sperm antigen, as well as by induction of neutralizing 
anti-hCG antibodies by vaccination by DNA vaccines encoding human chorionic gonadotropin (hCG), or a fragment 
thereof. 

[0098] Sperm antigens which can be used in the genetic vaccines include, for example, lactate dehydrogenase 
(LDH-C4), galactosyltransferase (GT), SP-10, rabbit sperm autoantigen (RSA), guinea pig (g)PH-20, cleavage signal 
55 protein (CS-1), HSA-63, human (h)PH-20, and AgX-1 (Zhu and Naz (1994) Arch. Androl. 33: 141-144), the synthetic 
sperm peptide, P10G (O'R and etal. (1993) J. Reprod. Immunol. 25: 89-102), the 135kD, 95kD, 65kD, 47kD, 41 kD 
and 23kD proteins of sperm, and the FA-1 antigen (Naz etal. (1 995) Arch. Androl. 35: 225-231 ), and the 35 kD fragment 
of cytokeratin 1 (Lucas etal. (1996) Anticancer Res. 16: 2493-2496). 
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[0099] The methods of the invention can also be used to obtain genetic vaccines that are expressed specifically in 
testis. For example, polynucleotide sequences that direct expression of genes that are specific to testis can be used 
(e.g., fertilization antigen-1 and the like). In addition to sperm antigens, antigens expressed on oocytes or hormones 
regulating reproduction may be useful targets of contraceptive vaccines. For example, genetic vaccines can be used 
5 to generate antibodies against gonadotropin releasing hormone (GnRH) or zona pellucida proteins (Miller et al. (1 997) 
Vaccine 15:1858-1862). Vaccinations using these molecules have been shown to be efficacious in animal models 
(Miller et al. (1 997) Vaccine 15:1 858-1 862). Another example of a useful component of a genetic contraceptive vaccine 
is the ovarian zona pellucida glycoprotein ZP3 (Tung et al. (1994) Reprod. Fertil. Dev. 6:349-355). 

1 o Methods of Selecting and Identifying Optimized Recombinant Antigens 

[0100] Once one has performed DNA shuffling to obtain a library of polynucleotides that encode recombinant anti- 
gens, the library is subjected to selection and/or screening to identify those library members that encode antigenic 
peptides that have improved ability to induce an immune response to the pathogenic agent. Selection and screening 

15 of recombinant polynucleotides that encode polypeptides having an improved ability to induce an immune response 
can involve either in vivo and in vitro methods, but most often involves a combination of these methods. For example, 
in a typical embodiment the members of a library of recombinant nucleic acids are picked, either individually or as 
pools. The clones can be subjected to analysis directly, or can be expressed to produce the corresponding polypeptides. 
In a presently preferred embodiment, an in vitro screen is performed to identify the best candidate sequences for the 

20 in vivo studies. Alternatively, the library can be subjected to in vivo challenge studies directly. The analyses can employ 
either the nucleic acids themselves (e.g., as genetic vaccines), or the polypeptides encoded by the nucleic acids. A 
schematic diagram of a typical strategy is shown in Figure 5. Both in vitro and in vivo methods are described in more 
detail below. 

[0101] If a recombination cycle is performed in vitro, the products of recombination, i.e., recombinant segments, are 
25 sometimes introduced into cells before the screening step. Recombinant segments can also be linked to an appropriate 
vector or other regulatory sequences before screening. Alternatively, products of recombination generated in vitro are 
sometimes packaged in viruses (e.g., bacteriophage) before screening. If recombination is performed in vivo, recom- 
bination products can sometimes be screened in the cells in which recombination occurred. In other applications, 
recombinant segments are extracted from the cells, and optionally packaged as viruses, before screening. 
30 [0102] Often, improvements are achieved after one round of recombination and selection. However, recursive se- 
quence recombination can also be employed to achieve still further improvements in a desired property, or to bring 
about new (or "distinct") properties. Recursive sequence recombination entails successive cycles of recombination to 
generate molecular diversity. That is, one creates a family of nucleic acid molecules showing some sequence identity 
to each other but differing in the presence of mutations. In any given cycle, recombination can occur in vivo or in vitro, 
35 intracellular^ or extracellularly. Furthermore, diversity resulting from recombination can be augmented in any cycle by 
applying prior methods of mutagenesis (e.g., error-prone PCR or cassette mutagenesis) to either the substrates or 
products for recombination. 

[0103] In a presently preferred embodiment, polynucleotides that encode optimized recombinant antigens are sub- 
jected to molecular backcrossing, which provides a means to breed the shuffled chimeras/mutants back to a parental 

40 or wild-type sequence, while retaining the mutations that are critical to the phenotype that provides the optimized 
immune responses. In addition to removing the neutral mutations, molecular backcrossing can also be used to char- 
acterize which of the many mutations in an improved variant contribute most to the improved phenotype. This cannot 
be accomplished in an efficient library fashion by any other method. Backcrossing is performed by shuffling the improved 
sequence with a large molar excess of the parental sequences. 

45 [0104] The nature of screening or selection depends on what property or characteristic is to be acquired or the 
property or characteristic for which improvement is sought, and many examples are discussed below. It is not usually 
necessary to understand the molecular basis by which particular products of recombination (recombinant segments) 
have acquired new or improved properties or characteristics relative to the starting substrates. For example, a gene 
that encodes an antigenic polypeptide can have many component sequences each having a different intended role 

50 (see, e.g., Figure 4). Each of these component sequences can be varied and recombined simultaneously. Screening/ 
selection can then be performed, for example, for recombinant segments that have increased ability to induce an 
immune response to a pathogenic agent without the need to attribute such improvement to any of the individual com- 
ponent sequences of the recombinant polynucleotide. 

[0105] Depending on the particular screening protocol used for a desired property, initial round(s) of screening can 
55 sometimes be performed using bacterial cells due to high transfection efficiencies and ease of culture. However, es- 
pecially for testing of immunogenic activity, test animals are used for library expression and screening. Similarly other 
types of screening which are not amenable to screening in bacterial or simple eukaryotic library cells, are performed 
in cells selected for use in an environment close to thatof their intended use. Final rounds of screening can be performed 
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in cells or organisms that are as close as possible to the precise cell type or organism of intended use. 
[0106] If further improvement in a property is desired, at least one, and usually a collection, of recombinant segments 
surviving a first round of screening/selection are subject to a further round of recombination. These recombinant seg- 
ments can be recombined with each other or with exogenous segments representing the original substrates or further 
variants thereof. Again, recombination can proceed in vitro or in vivo. If the previous screening step identifies desired 
recombinant segments as components of cells, the components can be subjected to further recombination in vivo, or 
can be subjected to further recombination in vitro, or can be isolated before performing a round of in vitro recombination. 
Conversely, if the previous screening step identifies desired recombinant segments in naked form or as components 
of viruses, these segments can be introduced into cells to perform a round of in vivo recombination. The second round 
of recombination, irrespective how performed, generates further recombinant segments which encompass additional 
diversity than is present in recombinant segments resulting from previous rounds. 

[0107] The second round of recombination can be followed by a further round of screening/selection according to 
the principles discussed above for the first round. The stringency of screening/selection can be increased between 
rounds. Also, the nature of the screen and the property being screened for can vary between rounds if improvement 
in more than one property is desired or if acquiring more than one new property is desired. Additional rounds of re- 
combination and screening can then be performed until the recombinant segments have sufficiently evolved to acquire 
the desired new or improved property or function. 

[0108] The practice of this invention involves the construction of recombinant nucleic acids and the expression of 
genes in transfected host cells. Molecular cloning techniques to achieve these ends are known in the art. A wide variety 
of cloning and in vitro amplification methods suitable for the construction of recombinant nucleic acids such as expres- 
sion vectors are well-known to persons of skill. General texts which describe molecular biological techniques useful 
herein, including mutagenesis, include Berger and Kimmel, Guide to Molecular Cloning Techniques, Methods in En- 
zymo/ogy volume 152 Academic Press, Inc., San Diego, CA (Berger); Sambrook ef a/., Molecular Cloning - A Laboratory 
Manual (2nd Ed.), Vol. 1-3, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, 1989 ("Sambrook") and 
Current Protocols in Molecular Biology, F.M. Ausubel et al., eds., Current Protocols, a joint venture between Greene 
Publishing Associates, Inc. and John Wiley & Sons, Inc., (supplemented through 1998) ("Ausubel")). Examples of 
techniques sufficient to direct persons of skill through in vitro amplification methods, including the polymerase chain 
reaction (PCR) the ligase chain reaction (LCR), Q - replicase amplification and other RNA polymerase mediated tech- 
niques (e.g., NASBA) are found in Berger, Sambrook, and Ausubel, as well as Mullis et at. (1987) U.S. Patent No. 
4,683,202; PCR Protocols A Guide to Methods and Applications (Innis et at. eds) Academic Press Inc. San Diego, CA 
(1990) (Innis); Amheim & Levinson (October 1, 1990) C&EN 36-47; The Journal Of NIH Research (1991) 3, 81-94; 
(Kwoh etal. (1989) Proc. Natl. Acad. Sci. USA 86, 1173; Guatelli et at. (1990) Proc. Natl. Acad. Sci. USA 87, 1874; 
Lomell etal. (1989) J. Clin. Chem 35, 1826; Landegren etal. (1988) Science 241, 1077-1080; Van Brunt (1990) Bio- 
technology 8, 291-294; Wu and Wallace (1989) Gene 4, 560; Barringer et at. (1990) Gene 89, 117, and Sooknanan 
and Malek( 1995) Biotechnology 13: 563-564. Improved methods of cloning in vitro amplified nucleic acids are described 
in Wallace etal., U.S. Pat. No. 5,426,039. Improved methods of amplifying large nucleic acids by PCR are summarized 
in Cheng et al. (1994) Nature 369: 684-685 and the references therein, in which PCR amplicons of up to 40kb are 
generated. One of skill will appreciate that essentially any RNA can be converted into a double stranded DNA suitable 
for restriction digestion, PCR expansion and sequencing using reverse transcriptase and a polymerase. See, Ausubel, 
Sambrook and Berger, all supra. 

[0109] Oligonucleotides for use as probes, e.g., in in vitro amplification methods, for use as gene probes, or as 
shuffling targets (e.g., synthetic genes or gene segments) are typically synthesized chemically according to the solid 
phase phosphoramidite triester method described by Beaucage and Caruthers (1981) Tetrahedron Letts., 22(20): 
1859-1862, e.g., using an automated synthesizer, as described in Needham-VanDevanter et al. (1984) Nucleic Acids 
Res., 12:6159-6168. Oligonucleotides can also be custom made and ordered from a variety of commercial sources 
known to persons of skill. 

[0110] Indeed, essentially any nucleic acid with a known sequence can be custom ordered from any of a variety of 
commercial sources, such as The Midland Certified Reagent Company (mcrc@oligos.com), The Great American Gene 
Company (http://www.genco.com), ExpressGen Inc. (www.expressgen.com), Operon Technoloigies Inc. (Alameda, 
CA) and many others. Similarly, peptides and antibodies can be custom ordered from any of a variety of sources, such 
as PeptidoGenic (pkim@ccnet.com), HTI Bio-products, Inc. (http://www.htibio.com), BMA Biomedicals Ltd (U.K.), Bio 
Synthesis, Inc., and many others. 

1 . Purification and in vitro analysis of recombinant nucleic acids and polypeptides 

[0111] Once DNA shuffling has been performed, the resulting library of recombinant polynucleotides can be subjected 
to purification and preliminary analysis in vitro, in order to identify the most promising candidate recombinant nucleic 
acids. Advantageously, the assays can be practiced in a high-throughput format. For example, to purify individual 
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shuffled recombinant antigens, clones can robotically picked into 96-well formats, grown, and, if desired, frozen for 
storage. 

[0112] Whole cell lysates (V-antigen), periplasmic extracts, or culture supernatants (toxins) can be assayed directly 
by ELISA as described below, but high throughput purification is sometimes also needed. Affinity chromatography 
using immobilized antibodies or incorporation of a small nonimmunogenic affinity tag such as a hexahistidine peptide 
with immobilized metal affinity chromatography will allow rapid protein purification. High binding-capacity reagents with 
96-well filter bottom plates provide a high throughput purification process. The scale of culture and purification will 
depend on protein yield, but initial studies will require less than 50 micrograms of protein. Antigens showing improved 
properties can be purified in larger scale by FPLC for re-assay and animal challenge studies. 

[0113] In some embodiments, the shuffled antigen-encoding polynucleotides are assayed as genetic vaccines. Ge- 
netic vaccine vectors containing the shuffled antigen sequences can be prepared using robotic colony picking and 
subsequent robotic plasmid purification. Robotic plasmid purification protocols are available that allow purification of 
600-800 plasmids per day. The quantity and purity of the DNA can also be analyzed in 96-well plates, for example. In 
a presently preferred embodiment, the amount of DNA in each sample is robotically normalized, which can significantly 
reduce the variation between different batches of vectors. 

[0114] Once the proteins and/or nucleic acids are picked and purified as desired, they can be subjected to any of a 
number of in vitro analysis methods. Such screenings include, for example, phage display, flow cytometry, and ELISA 
assays to identify antigens that are efficiently expressed and have multiple epitopes and a proper folding pattern. In 
the case of bacterial toxins, the libraries may also be screened for reduced toxicity in mammalian cells. 
[01 1 5] As one example, to identify recombinant antigens that are cross-reactive, one can use a panel of monoclonal 
antibodies for screening. A humoral immune response generally targets multiple regions of antigenic proteins. Accord- 
ingly, monoclonal antibodies can be raised against various regions of immunogenic proteins (Alving et ai. (1995) Im- 
munol. Rev. 145: 5). In addition, there are several examples of monoclonal antibodies that only recognize one strain 
of a given pathogen, and by definition, different serotypes of pathogens are recognized by different sets of antibodies. 
For example, a panel of monoclonal antibodies have been raised against VEE envelope proteins, thus providing a 
means to recognize different subtypes of the virus (Roehrig and Bolin (1997) J. Clin, Microbiol. 35: 1887). Such anti- 
bodies, combined with phage display and ELISA screening, can be used to enrich recombinant antigens that have 
epitopes from multiple pathogen strains. Flow cytometry based cell sorting will further allow for the selection of variants 
that are most efficiently expressed. 

[0116] Phage display provides a powerful method for selecting proteins of interest from large libraries (Bass et at. 
(1990) Proteins: Struct. Funct. Genet. 8: 309; Lowman and Wells (1991) Methods: A Companion to Methods Enz. 3 
(3);205-216. Lowman and Wells (1993) J. Mol. Biol. 234;564-578). Some recent reviews on the phage display technique 
include, for example, McGregor (1996) Mol Biotechnol. 6(2):1 55-62; Dunn (1996) Curr. Opin. Biotechnol. 7(5):547-53; 
Hill etal. (1996) Mol Microbiol 20(4) :6 85-92; Phage Display of Peptides and Proteins: A Laboratory Manual. BK. Kay, 
J. Winter, J, McCafferty eds., Academic Press 1996; O'Neil etal. (1995) Curr. Opin. Struct. Biol. 5(4):443-9; Phizicky 
etal. (1995) Microbiol Rev. 59(1 ):94-123; Clackson etal. (1994) Trends Biotechnol. 12(5):1 73-84; Felici etal. (1995) 
Biotechnol. Annu. Rev. 1:149-83; Burton (1995) Immunotechnology 1(2):87-94.) See,also, Cwirla etal., Proc. Natl. 
Acad. Sci. USA 87: 6378-6382 (1 990); Devlin etal., Science 249: 404-406 (1 990), Scott & Smith, Science 249: 386-388 
(1 990); Ladner etal., US 5,571 ,698. Each phage particle displays a unique variant protein on its surface and packages 
the gene encoding that particular variant. The shuffled genes for the antigens are fused to a protein that is expressed 
on the phage surface, e.g., gene III of phage M13, and cloned into phagemid vectors. In a presently preferred embod- 
iment, a suppressible stop codon (e.g., an amber stop codon) separates the genes so that in a suppressing strain of 
E. coli, the antigen-glllp fusion is produced and becomes incorporated into phage particles upon infection with M13 
helper phage. The same vector can direct production of the unfused antigen alone in a nonsuppressing E. coli for 
protein purification. 

[0117] The genetic packages most frequently used for display libraries are bacteriophage, particularly filamentous 
phage, and especially phage M13, Fd and Fl. Most work has involved inserting libraries encoding polypeptides to be 
displayed into either gill or gVIII of these phage forming a fusion protein. See, e.g., Dower, WO 91/1 981 8; Devlin, WO 
91/18989; MacCafferty, WO 92/01047 (gene III); Huse, WO 92/06204; Kang, WO 92/1 861 9 (gene VIII). Such a fusion 
protein comprises a signal sequence, usually but not necessarily, from the phage coat protein, a polypeptide to be 
displayed and either the gene III or gene VIII protein or a fragment thereof. Exogenous coding sequences are often 
inserted at or near the N-terminus of gene III or gene VIII although other insertion sites are possible. 
[0118] Eukaryotic viruses can be used to display polypeptides in an analogous manner. For example, display of 
human heregulin fused to gp70 of Moloney murine leukemia virus has been reported by Han et ai, Proc. Natl. Acad. 
Sci. USA 92: 9747-9751 (1995). Spores can also be used as replicable genetic packages. In this case, polypeptides 
are displayed from the outer surface of the spore. For example, spores from B. subtilis have been reported to be 
suitable. Sequences of coat proteins of these spores are provided by Donovan et al., J. Mol. Biol. 196, 1-10 (1987). 
Cells can also be used as replicable genetic packages. Polypeptides to be displayed are inserted into a gene encoding 
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a cell protein that is expressed on the cells surface. Bacterial cells including Salmonella typhimuhum, Bacillus subtilis, 
Pseudomonas aeruginosa, Vibrio choierae, Klebsiella pneumonia, Neisseria gonorrhoeae, Neisseria meningitidis, 
Bacteroides nodosus, Moraxella bovis, and especially Escherichia coli are preferred. Details of outer surface proteins 
are discussed by Ladner et al., US 5,571 ,698 and references cited therein. For example, the lamB protein of E. coli is 
suitable. 

[01 19] A basic concept of display methods that use phage or other replicable genetic package is the establishment 
of a physical association between DNA encoding a polypeptide to be screened and the polypeptide. This physical 
association is provided by the replicable genetic package, which displays a polypeptide as part of a capsid enclosing 
the genome of the phage or other package, wherein the polypeptide is encoded by the genome. The stablishment of 
a physical association between polypeptides and their genetic material allows simultaneous mass screening of very 
large numbers of phage bearing different polypeptides. Phage displaying a polypeptide with affinity to a target, e.g., a 
receptor, bind to the target and these phage are enriched by affinity screening to the target. The identity of polypeptides 
displayed from these phage can be determined from their respective genomes. Using these methods a polypeptide 
identified as having a binding affinity for a desired target can then be synthesized in bulk by conventional means, or 
the polynucleotide that encodes the peptide or polypeptide can be used as part of a genetic vaccine. 
[0120] Variants with specific binding properties, in this case binding to family-specific antibodies, are easily enriched 
by panning with immobilized antibodies. Antibodies specific for a single family are used in each round of panning to 
rapidly select variants that have multiple epitopes from the antigen families. For example, A-family specific antibodies 
can be used to select those shuffled clones that display A-specific epitopes in the first round of panning. A second 
round of panning with B-specific antibodies will select from the "A" clones those that display both A- and B-specific 
epitopes. A third round of panning with C-specific antibodies will select for variants with A, B, and C epitopes. A continual 
selection exists during this process for clones that express well in E. coli and that are stable throughout the selection. 
Improvements in factors such as transcription, translation, secretion, folding and stability are often observed and will 
enhance the utility of selected clones for use in vaccine production. 

[0121] Phage ELISA methods can be used to rapidly characterize individual variants. These assays provide a rapid 
method for quantitation of variants without requiring purification of each protein. Individual clones are arrayed into 
96-well plates, grown, and frozen for storage. Cells in duplicate plates are infected with helper phage, grown overnight 
and pelleted by centrifugation. The supernatants containing phage displaying particular variants are incubated with 
immobilized antibodies and bound clones are detected by anti-M13 antibody conjugates. Titration series of phage 
particles, immobilized antigen, and/or soluble antigen competition binding studies are all highly effective means to 
quantitate protein binding. Variant antigens displaying multiple epitopes will be further studied in appropriate animal 
challenge models. 

[0122] Several groups have reported an in vitro ribosome display system for the screening and selection of mutant 
proteins with desired properties from large libraries. This technique can be used similarly to phage display to select or 
enrich for variant antigens with improved properties such as broad cross reactivity to antibodies and improved folding 
(see, e.g., Hanes et al. (1997) Proc. Nat 7. Acad. Sci. USA 94(1 0):4937-42; Mattheakis et al. (1994) Proc. Nat 1 Acad. 
Sci. USA 91(19):9022-6; He et al. (1997) Nucl. Acids Res. 25(24):5132-4; Nemoto et al. (1997) FEBS Lett. 414(2): 
405-8). 

[0123] Other display methods exist to screen antigens for improved properties such as increased expression levels, 
broad cross reactivity, enhanced folding and stability. These include, but are not limited to display of proteins on intact 
E. coli or other cells (e.g., Francisco et al. (1993) Proc. Nat'l. Acad. Sci. USA 90: 1044-10448; Lu et al. (1995) Biol 
Technology 13: 366-372). Fusions of shuffled antigens to DNA-binding proteins can link the antigen protein to its gene 
in an expression vector (Schatz et al. (1996) Methods Enzymol. 267: 171-91; Gates et al. (1996) J. Mol. Biol. 255: 
373-86.) 

[0124] The various display methods and ELISA assays can be used to screen for shuffled antigens with improved 
properties such as presentation of multiple epitopes, improved immunogenicity, increased expression levels, increased 
folding rates and efficiency, increased stability to factors such as temperature, buffers, solvents, improved purification 
properties, etc. Selection of shuffled antigens with improved expression, folding, stability and purification profile under 
a variety of chromatographic conditions can be very important improvements to incorporate for the vaccine manufac- 
turing process. 

[0125] To identify recombinant antigenic polypeptides that exhibit improved expression in a host cell, flow cytometry 
is a useful technique. Flow cytometry provides a method to efficiently analyze the functional properties of millions of 
individual cells. One can analyze the expression levels of several genes simultaneously, and flow cytometry-based 
cell sorting allows for the selection of cells that display properly expressed antigen variants on the cell surface or in 
the cytoplasm. Very large numbers (>10 7 ) of cells can be evaluated in a single vial experiment, and the pool of the 
best individual sequences can be recovered from the sorted cells. These methods are particularly useful in the case 
of, for example, Hantaan virus glycoproteins, which are generally very poorly expressed in mammalian cells. This 
approach provides a general solution to improve expression levels of pathogen antigens in mammalian cells, a phe- 
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nomenon that is critical for the function of genetic vaccines. 

[0126] To use flow cytometry to analyze polypeptides that are not expressed on the cell surface, one can engineer 
the recombinant polynucleotides in the library such that the polynucleotide is expressed as a fusion protein that has a 
region of amino acids which is targeted to the cell membrane. For example, the region can encode a hydrophobic 
stretch of C-terminal amino acids which signals the attachment of a phosphoinositol-glycan (PIG) terminus on the 
expressed protein and directs the protein to be expressed on the surface of the transfected cell (Whitehorn etal. (1 995) 
Biotechnology (N Y) 13:1215-9). With an antigen that is naturally a soluble protein, this method will likely not affect the 
three dimensional folding of the protein in this engineered fusion with anew C-terminus. With an antigen that is naturally 
a transmembrane protein (e.g., a surface membrane protein on pathogenic viruses, bacteria, protozoa or tumor cells) 
there are at least two possibilities. First, the extracellular domain can be engineered to be in fusion with the C-terminal 
sequence for signaling PIG-linkage. Second, the protein can be expressed in toto relying on the signalling of the host 
cell to direct it efficiently to the cell surface. In a minority of cases, the antigen for expression will have an endogenous 
PIG terminal linkage (e.g., some antigens of pathogenic protozoa). 

[0127] Those cells expressing the antigen can be identified with a fluorescent monoclonal antibody specific for the 
C-terminal sequence on PIG-linked forms of the surface antigen. FACS analysis allows quantitative assessment of the 
level of expression of the correct form of the antigen on the cell population. Cells expressing the maximal level of 
antigen are sorted and standard molecular biology methods are used to recover the plasmid DNA vaccine vector that 
conferred this reactivity. An alternative procedure that allows purification of all those cells expressing the antigen (and 
that may be useful prior to loading onto a cell sorter since antigen expressing cells may be a very small minority 
population), is to rosette or pan-purify the cells expressing surface antigen. Rosettes can be formed between antigen 
expressing cells and erythrocytes bearing covalently coupled antibody to the relevant antigen. These are readily purified 
by unit gravity sedimentation. Panning of the cell population over petri dishes bearing immobilized monoclonal antibody 
specific for the relevant antigen can also be used to remove unwanted cells. 

[0128] In the high throughput assays of the invention, it is possible to screen up to several thousand different shuffled 
variants in a single day. For example, each well of a microtiter plate can be used to run a separate assay, or, if con- 
centration or incubation time effects are to be observed, every 5-10 wells can test a single variant. Thus, a single 
standard microtiter plate can assay about 1 00 (e.g., 96) reactions. If 1 536 well plates are used, then a single plate can 
easily assay from about 100 to about 1500 different reactions. It is possible to assay several different plates per day; 
assay screens for up to about 6,000-20,000 different assays (i.e., involving different nucleic acids, encoded proteins, 
concentrations, etc.) is possible using the integrated systems of the invention. More recently, microfluidic approaches 
to reagent manipulation have been developed, e.g., by Caliper Technologies (Palo Alto, CA). 

[0129] In one aspect, library members, e.g., cells, viral plaques, or the like, are separated on solid media to produce 
individual colonies (or plaques). Using an automated colony picker (e.g., the Q-bot, Genetix, U.K.), colonies or plaques 
are identified, picked, and up to 1 0,000 different mutants inoculated into 96 well microtiter dishes, optionally containing 
glass balls in the wells to prevent aggregation. The Q-bot does not pick an entire colony but rather inserts a pin through 
the center of the colony and exits with a small sampling of cells (or viruses in plaque applications). The time the pin is 
in the colony, the number of dips to inoculate the culture medium, and the time the pin is in that medium each effect 
inoculum size, and each can be controlled and optimized. The uniform process of the Q-bot decreases human handling 
error and increases the rate of establishing cultures (roughly 10,000/4 hours). These cultures are then shaken in a 
temperature and humidity controlled incubator. The glass balls in the microtiter plates act to promote uniform aeration 
of cells dispersal of cells, or the like, similar to the blades of a fermentor. Clones from cultures of interest can be cloned 
by limiting dilution. Plaques or cells constituting libraries can also be screened directly for production of proteins, either 
by detecting hybridization, protein activity, protein binding to antibodies, or the like. 

[0130] The ability to detect a subtle increase in the performance of a shuffled library member over that of a parent 
strain relies on the sensitivity of the assay. The chance of finding the organisms having an improvement in ability to 
induce an immune response is increased by the number of individual mutants that can be screened by the assay. To 
increase the chances of identifying a pool of sufficient size, a prescreen that increases the number of mutants processed 
by 10-fold can be used. The goal of the prescreen will be to quickly identify mutants having equal or better product 
titers than the parent strain(s) and to move only these mutants forward to liquid cell culture for subsequent analysis. 
[0131] A number of well known robotic systems have also been developed for solution phase chemistries useful in 
assay systems. These systems include automated workstations like the automated synthesis apparatus developed by 
Takeda Chemical Industries, LTD. (Osaka, Japan) and many robotic systems utilizing robotic arms (Zymate II, Zymark 
Corporation, Hopkinton, Mass.; Orca, Hewlett-Packard, Palo Alto, Calif.) which mimic the manual synthetic operations 
performed by a scientist. Any of the above devices are suitable for use with the present invention, e.g., for high-through- 
put screening of molecules encoded by codon-altered nucleic acids. The nature and implementation of modifications 
to these devices (if any) so that they can operate as discussed herein with reference to the integrated system will be 
apparent to persons skilled in the relevant art. 

[0132] High throughput screening systems are commercially available (see, e.g., Zymark Corp., Hopkinton, MA; Air 
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Technical Industries, Mentor, OH; Beckman Instruments, Inc. Fullerton, CA; Precision Systems, Inc., Natick, MA, etc.). 
These systems typically automate entire procedures including all sample and reagent pipetting, liquid dispensing, timed 
incubations, and final readings of the microplate in detector(s) appropriate for the assay. These configurable systems 
provide high throughput and rapid start up as well as a high degree of flexibility and customization. 
5 [0133] The manufacturers of such systems provide detailed protocols the various high throughput. Thus, for example, 
Zymark Corp. provides technical bulletins describing screening systems for detecting the modulation of gene transcrip- 
tion, ligand binding, and the like. Microfluidic approaches to reagent manipulation have also been developed, e.g., by 
Caliper Technologies (Palo Alto, CA). 

[0134] Optical images viewed (and, optionally, recorded) by a camera or other recording device (e.g., a photodiode 
10 and data storage device) are optionally further processed in any of the embodiments herein, e.g., by digitizing the 
image and/or storing and analyzing the image on acomputer. As noted above, in some applications, the signals resulting 
from assays are florescent, making optical detection approaches appropriate in these instances. A variety of commer- 
cially available peripheral equipment and software is available for digitizing, storing and analyzing a digitized video or 
digitized optical image, e.g., using PC (Intel x86 or Pentium chip- compatible DOS , OS2 WINDOWS , WINDOWS NT 
15 or WINDOWS95 based machines), MACINTOSH , or UNIX based (e.g., SUN work station) computers. 

[0135] One conventional system carries light from the assay device to a cooled charge-coupled device (CCD) cam- 
era, in common use in the art. A CCD camera includes an array of picture elements (pixels). The light from the specimen 
is imaged on the CCD. Particular pixels corresponding to regions of the specimen (e.g., individual hybridization sites 
on an array of biological polymers) are sampled to obtain light intensity readings for each position. Multiple pixels are 
20 processed in parallel to increase speed. The apparatus and methods of the invention are easily used for viewing any 
sample, e.g., by fluorescent or dark field microscopic techniques. 

[0136] Integrated systems for analysis in the present invention typically include a digital computer with high-through- 
put liquid control software, image analysis software, data interpretation software, a robotic liquid control armature for 
transferring solutions from a source to a destination operably linked to the digital computer, an input device (e.g., a 
25 computer keyboard) for entering data to the digital computer to control high throughput liquid transfer by the robotic 
liquid control armature and, optionally, an image scanner for digitizing label signals from labeled assay component. 
The image scanner interfaces with the image analysis software to provide a measurement of optical intensity. Typically, 
the intensity measurement is interpreted by the data interpretation software to show whether the optimized recombinant 
antigenic polypeptide products are produced. 

30 

2. Antigen Library Immunization 

[0137] In a presently preferred embodiment, antigen library immunization (ALI) is used to identify optimized recom- 
binant antigens that have improved immunogenicity. ALI involves introduction of the library of recombinant antigen- 

35 encoding nucleic acids, or the recombinant antigens encoded by the shuffled nucleic acids, into a test animal. The 
animals are then subjected to in vivo challenge using live pathogens. Neutralizing antibodies and cross-protective 
immune responses are studied after immunization with the entire libraries, pools and/or individual antigen variants. 
[0138] Methods of immunizing test animals are well known to those of skill in the art. In presently preferred embod- 
iments, test animals are immunized twice or three times at two week intervals. One week after the last immunization, 

40 the animals are challenged with live pathogens (or mixtures of pathogens), and the survival and symptoms of the 
animals is followed. Immunizations using test animal challenge are described in, for example, Roggenkamp et ai. 
(1997) Infect. Immun. 65: 446; Woody etal. (1997) Vaccine 2: 133; Agren etal. (1997) J. Immunol. 158: 3936; Konishi 
etal. (1992) Virology 190: 454; Kinney etal. (1988) J. Virol. 62: 4697; lacono-Connors etal. (1996) Virus Res. 43: 125; 
Kochel etal. (1997) Vaccine 15: 547; and Chu etal. (1995) J. Virol. 69: 6417. 

45 [0139] The immunizations can be performed by injecting either the recombinant polynucleotides themselves, i.e., as 
a genetic vaccine, or by immunizing the animals with polypeptides encoded by the recombinant polynucleotides. Bac- 
terial antigens are typically screened primarily as recombinant proteins, whereas viral antigens are preferably analyzed 
using genetic vaccinations. 

[0140] To dramatically reduce the number of experiments required to identify individual antigens having improved 
50 immunogenic properties, one can use pooling and deconvolution, as diagrammed in Figure 6. Pools of recombinant 
nucleic acids, or polypeptides encoded by the recombinant nucleic acids, are used to immunize test animals. Those 
pools that result in protection against pathogen challenge are then subdivided and subjected to additional analysis. 
The high throughput in vitro approaches described above can be used to identify the best candidate sequences for 
the in vivo studies. 

55 [0141] The challenge models that can be used to screen for protective antigens include pathogen and toxin models, 
such as Yersinia bacteria, bacterial toxins (such as Staphylococcal and Streptococcal en terotox ins, E.colilV. cholerae 
enterotoxins), Venezuelan equine encephalitis virus (VEE), Flaviviruses (Japanese encephalitis virus, Tick-borne en- 
cephalitis virus, Dengue virus), Hantaan virus, Herpes simplex, influenza virus (e.g., Influenza A virus), Vesicular Sto- 
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matit is Virus, Pseudomonas aeruginosa, Salmonella typhi murium, Escherichia coii, Klebsiella pneumoniae, Toxoplas- 
ma gondii, Plasmodium yoeiii, Herpes simplex, influenza virus (e.g., Influenza A virus), and Vesicular Stomatitis Virus. 
However, the test animals can also be challenged with tumor cells to enable screening of antigens that efficiently protect 
against malignancies. Individual shuffled antigens or pools of antigens are introduced into the animals intradermally, 
intramuscularly, intravenously, intratracheal^, anally, vaginally, orally, or intraperitoneal^ and antigens that can prevent 
the disease are chosen, when desired, for further rounds of shuffling and selection. Eventually, the most potent antigens, 
based on in vivo data in test animals and comparative in vitro studies in animals and man, are chosen for human trials, 
and their capacity to prevent and treat human diseases is investigated. 

[0142] In some embodiments, antigen library immunization and pooling of individual clones is used to immunize 
against a pathogen strain that was not included in the sequences that were used to generate the library. The level of 
crossprotection provided by different strains of a given pathogen can significantly. However, homologous titer is always 
higher than heterologous titer. Pooling and deconvolution is especially efficient in models where minimal protection is 
provided by the wild-type antigens used as starting material for shuffling (for example minimal protection by antigens 
A and B against strain C in Figure 3B). This approach can be taken, for example, when evolving the V-antigen of 
Yersinaeox Hantaan virus glycoproteins. 

[0143] In some embodiments, the desired screening involves analysis of the immune response based on immuno- 
logical assays known to those skilled in the art. Typically, the test animals are first immunized and blood or tissue 
samples are collected for example one to two weeks after the last immunization. These studies enable one to one can 
measure immune parameters that correlate to protective immunity, such as induction of specific antibodies (particularly 
IgG) and induction of specific T lymphocyte responses, in addition to determining whether an antigen or pools of an- 
tigens provides protective immunity. Spleen cells or peripheral blood mononuclear cells can be isolated from immunized 
test animals and measured for the presence of antigen-specific T cells and induction of cytokine synthesis. ELISA, 
ELISPOT and cytoplasmic cytokine staining, combined with flow cytometry, can provide such information on a single- 
cell level. 

[0144] Common immunological tests that can be used to identify the efficacy of immunization include antibody meas- 
urements, neutralization assays and analysis of activation levels or frequencies of antigen presenting cells or lym- 
phocytes that are specific for the antigen or pathogen. The test animals that can be used in such studies include, but 
are not limited to, mice, rats, guinea pigs, hamsters, rabbits, cats, dogs, pigs and monkeys. Monkey is a particularly 
useful test animal because the MHC molecules of monkeys and humans are very similar. 

[0145] Virus neutralization assays are useful for detection of antibodies that not only specifically bind to the pathogen, 
but also neutralize the function of the virus. These assays are typically based on detection of antibodies in the sera of 
immunized animal and analysis of these antibodies for their capacity to inhibit viral growth in tissue culture cells. Such 
assays are known to those skilled in the art. One example of a virus neutralization assay is described by Dolin R (J. 
Infect. Dis. 1995, 172:1175-83). Virus neutralization assays provide means to screen for antigens that also provide 
protective immunity. 

[0146] In some embodiments, shuffled antigens are screened for their capacity to induce T cell activation in vivo. 
More specifically, peripheral blood mononuclear cells or spleen cells from injected mice can be isolated and the capacity 
of cytotoxic T lymphocytes to lyse infected, autologous target cells is studied. The spleen cells can be reactivated with 
the specific antigen in vitro. In addition, T helper cell activation and differentiation is analyzed by measuring cell pro- 
liferation or production of T H 1 (IL-2 and IFN-y) and T H 2 (IL-4 and IL-5) cytokines by ELISA and directly in CD4+ T cells 
by cytoplasmic cytokine staining and flow cytometry. Based on the cytokine production profile, one can also screen for 
alterations in the capacity of the antigens to direct T H 1/T H 2 differentiation (as evidenced, for example, by changes in 
ratios of IL-4/IFN-y, IL-4/IL-2, IL-5/IFN-y, IL-5/IL-2, IL-13/IFN-y, IL-1 3/IL-2). The analysis of the T cell activation induced 
by the antigen variants is a very useful screening method, because potent activation of specific T cells in vivo correlates 
to induction of protective immunity. 

[0147] The frequency of antigen-specific CD8+ T cells in vivo can also be directly analyzed using tetramers of MHC 
class I molecules expressing specific peptides derived from the corresponding pathogen antigens (Ogg and McMichael, 
Curr. Opin. Immunol. 1998, 10:393-6; Altman et ai, Science 1996, 274:94-6). The binding of the tetramers can be 
detected using flow cytometry, and will provide information about the efficacy of the shuffled antigens to induce acti- 
vation of specific T cells. For example, flow cytometry and tetramer stainings provide an efficient method of identifying 
T cells that are specific to a given antigen or peptide. Another method involves panning using plates coated with 
tetramers with the specific peptides. This method allows large numbers of cells to be handled in a short time, but the 
method only selects for highest expression levels. The higher the frequency of antigen-specific T cells in vivo is, the 
more efficient the immunization has been, enabling identification of the antigen variants that have the most potent 
capacity to induce protective immune responses. These studies are particularly useful when conducted in monkeys, 
or other primates, because the MHC class I molecules of humans mimic those of other primates more closely than 
those of mice. 

[0148] Measurement of the activation of antigen presenting cells (APC) in response to immunization by antigen 
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variants is another useful screening method. Induction of APC activation can be detected based on changes in surface 
expression levels of activation antigens, such as B7-1 (CD80), B7-2 (CD86), MHC class I and II, CD14, CD23, and Fc 
receptors, and the like. 

[01 49] Shuffled cancer antigens that induce cytotoxic T cells that have the capacity to kill cancer cells can be identified 
5 by measuring the capacity of T cells derived from immunized animals to kill cancer cells in vitro. Typically the cancer 
cells are first labeled with radioactive isotopes and the release of radioactivity is an indication of tumor cell killing after 
incubation in the presence of T cells from immunized animals. Such cytotoxicity assays are known in the art. 
[01 50] An indication of the efficacy of an antigen to activate T cells specific for, for example, cancer antigens, allergens 
or autoantigens, is also the degree of skin inflammation when the antigen is injected into the skin of a patient or test 
10 animal. Strong inflammation is correlated with strong activation of antigen-specific T cells. Improved activation of tumor- 
specific T cells may lead to enhanced killing of the tumors. In case of autoantigens, one can add immunomodulators 
that skew the responses towards T H 2, whereas in the case of allergens a T H 1 response is desired. Skin biopsies can 
be taken, enabling detailed studies of the type of immune response that occurs at the sites of each injection (in mice 
and monkeys large numbers of injections/antigens can be analyzed). Such studies include detection of changes in 
15 expression of cytokines, chemokines, accessory molecules, and the like, by cells upon injection of the antigen into the 
skin. 

[0151] To screen for antigens that have optimal capacity to activate antigen-specific T cells, peripheral blood mono- 
nuclear cells from previously infected or immunized humans individuals can be used. This is a particularly useful meth- 
od, because the MHC molecules that will present the antigenic peptides are human MHC molecules. Peripheral blood 

20 mononuclear cells or purified professional antigen-presenting cells (APCs) can be isolated from previously vaccinated 
or infected individuals or from patients with acute infection with the pathogen of interest. Because these individuals 
have increased frequencies of pathogen-specific T cells in circulation, antigens expressed in PBMCs or purified APCs 
of these individuals will induce proliferation and cytokine production by antigen-specific CD4+ and CD8+ T cells. Thus, 
antigens that simultaneously harbor epitopes from several antigens can be recognized by their capacity to stimulate 

25 T cells from various patients infected or immunized with different pathogen antigens, cancer antigens, autoantigens 
or allergens. One buffy coat derived from a blood donor contains lymphocytes from 0.5 liters of blood, and up to 1 0 4 
PBMC can be obtained, enabling very large screening experiments using T cells from one donor. 
[0152] When healthy vaccinated individuals (lab volunteers) are studied, one can make EBV-transformed B cell lines 
from these individuals. These cell lines can be used as antigen presenting cells in subsequent experiments using blood 

30 from the same donor; this reduces interassay and donor-to-donor variation. In addition, one can make antigen-specific 
T cell clones, after which antigen variants are introduced to EBV transformed B cells. The efficiency with which the 
transformed B cells induce proliferation of the specific T cell clones is then studied. When working with specific T cell 
clones, the proliferation and cytokine synthesis responses are significantly higher than when using total PBMCs, be- 
cause the frequency of antigen-specific T cells among PBMC is very low. 

35 [0153] CTL epitopes can be presented by most cells types since the class I major histocompatibility complex (MHC) 
surface glycoproteins are widely expressed. Therefore, transfection of cells in culture by libraries of shuffled antigen 
sequences in appropriate expression vectors can lead to class I epitope presentation. If specific CTLs directed to a 
given epitope have been isolated from an individual, then the co-culture of the transfected presenting cells and the 
CTLs can lead to release by the CTLs of cytokines, such as IL-2, IFN-y, or TNF, if the epitope is presented. Higher 

40 amounts of released TNF will correspond to more efficient processing and presentation of the class I epitope from the 
shuffled, evolved sequence. Shuffled antigens that induce cytotoxic T cells that have the capacity to kill infected cells 
can also be identified by measuring the capacity of T cells derived from immunized animals to kill infected cells in vitro. 
Typically the target cells are first labeled with radioactive isotopes and the release of radioactivity is an indication of 
target cell killing after incubation in the presence of T cells from immunized animals. Such cytotoxicity assays are 

45 known in the art. 

[0154] A second method for identifying optimized CTL epitopes does not require the isolation of CTLs reacting with 
the epitope. In this approach, cells expressing class I MHC surface glycoproteins are transfected with the library of 
evolved sequences as above. After suitable incubation to allow for processing and presentation, a detergent soluble 
extract is prepared from each cell culture and after a partial purification of the MHC-epitope complex (perhaps optional) 
50 the products are submitted to mass spectrometry (Henderson et al. (1993) Proc, Nat'l. Acad. Sci. USA 90: 
1 0275-1 0279). Since the sequence is known of the epitope whose presentation to be increased, one can calibrate the 
mass spectrogram to identify this peptide. In addition, a cellular protein can be used for internal calibration to obtain a 
quantitative result; the cellular protein used for internal calibration could be the MHC molecule itself. Thus one can 
measure the amount of peptide epitope bound as a proportion of the MHC molecules. 

55 

Use of Recombinant Multivalent Antigens 

[0155] The multivalent antigens of the invention are useful for treating and/or preventing the various diseases and 
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conditions with which the respective antigens are associated. For example, the multivalent antigens can be expressed 
in a suitable host cell and are administered in polypeptide form. Suitable formulations and dosage regimes for vaccine 
delivery are well known to those of skill in the art. 

[0156] In presently preferred embodiments, the optimized recombinant polynucleotides that encode improved aller- 
5 gens are used in conjunction with a genetic vaccine vector. The choice of vector and components can also be optimized 
for the particular purpose of treating allergy. For example, the polynucleotide that encodes the recombinant antigenic 
polypeptide can be placed under the control of a promoter, e.g., a high activity or tissue-specific promoter. The promoter 
used to express the antigenic polypeptide can itself be optimized using recombination and selection methods analogous 
to those described herein. The vector can contain immunostimulatory sequences such as are described in copending, 

10 commonly assigned US Patent Application Serial No. , entitled "Optimization of Immunomodulatory 

Molecules," filed as TTC Attorney Docket No. 1 8097-030300US on February 10, 1999. A vector engineered to direct 
a T H 1 response is preferred for many of the immune responses mediated by the antigens described herein (see, e.g., 

copending, commonly assigned US Patent Application Serial No. , entitled "Genetic Vaccine Vector 

Engineering," filed on February 1 0, 1 999 as TTC Attorney Docket No. 1 8097-0301 00US). It is sometimes advantageous 
15 to employ a genetic vaccine that is targeted for a particular target cell type (e.g., an antigen presenting cell or an antigen 
processing cell); suitable targeting methods are described in copending, commonly assigned US patent application 

Serial No. , entitled "Targeting of Genetic Vaccine Vectors," filed on February 10, 1999 as TTC 

Attorney Docket No. 1 8097-030200US. 

[0157] Genetic vaccines that encode the multivalent antigens described herein can be delivered to a mammal (in- 
20 eluding humans) to induce a therapeutic or prophylactic immune response. Vaccine delivery vehicles can be delivered 
in vivo by administration to an individual patient, typically by systemic administration (e.g., intravenous, intraperitoneal, 
intramuscular, subdermal, intracranial, anal, vaginal, oral, buccal route or they can be inhaled) or they can be admin- 
istered by topical application. Alternatively, vectors can be delivered to cells ex vivo, such as cells explanted from an 
individual patient (e.g., lymphocytes, bone marrow aspirates, tissue biopsy) or universal donor hematopoietic stem 
25 cells, followed by reimplantation of the cells into a patient, usually after selection for cells which have incorporated the 
vector. 

[0158] A large number of delivery methods are well known to those of skill in the art. Such methods include, for 
example liposome-based gene delivery (Debs and Zhu (1993) WO 93/24640; Mannino and Gould-Fogerite (1988) 
BioTechniques 6(7): 682-691; Rose U.S. Pat No. 5,279,833; Brigham (1991) WO 91/06309; and Feigner etai. (1987) 

30 Proc. Nati. Acad. Sci. USA 84: 7413-7414), as well as use of viral vectors (e.g., adenoviral (see, e.g., Berns et ai. 
(1995) Ann. NY Acad Sci. 772: 95-104; Ali et ai. (1994) Gene Ther. 1 : 367-384; and Haddada etai. (1995) Curr. Top. 
Microbiol. Immunol. 199 ( Pt 3): 297-306 for review), papillomaviral, retroviral (see, e.g., Buchscher etai. (1992) J. 
Virol. 66(5) 2731-2739; Johann etai. (1992) J. Virol. 66 (5):1 635-1 640 (1992); Sommerfelt etai., (1990) Virol. 176: 
58-59; Wilson etai. (1989) J. Virol. 63:2374-2378; Miller et al., J. Virol. 65:2220-2224 (1991); Wong-Staal etai., PCT/ 

35 US94/05700, and Rosenburg and Fauci (1993) in Fundamental Immunology, Third Edition Paul (ed) Raven Press, 
Ltd., New York and the references therein, and Yu et al., Gene Therapy (1994) supra.), and adeno-associated viral 
vectors (see, West etai. (1987) Virology 1 60:38-47; Carter etai. (1989) U.S. Patent No. 4,797,368; Carter etai. WO 
93/24641 (1993); Kotin (1994) Human Gene Therapy 5:793-801 ; Muzyczka (1 994) J. Clin. Invst. 94:1351 and Samulski 
(supra) for an overview of AAV vectors; see also, Lebkowski, U.S. Pat. No. 5,173,414; Tratschin et al. (1985) Mol. Cell. 

40 Biol. 5(1 1):3251 -3260; Tratschin, et al. (1984) Mol. Cell. Biol., 4:2072-2081; Hermonat and Muzyczka (1984) Proc. 
Natl. Acad. Sci. USA, 81:6466-6470; McLaughlin et al. (198S) and Samulski et al. (1989) J. Virol., 63:03822-3828), 
and the like. 

[0159] "Naked" DNA and/or RNA that comprises a genetic vaccine can be introduced directly into a tissue, such as 
muscle. See, e.g., USPN 5,580,859. Other methods such as "biolistic" or particle-mediated transformation (see, e.g., 

45 Sanford etai., USPN 4,945,050; USPN 5,036,006) are also suitable for introduction of genetic vaccines into cells of a 
mammal according to the invention. These methods are useful not only for in vivo introduction of DNA into a mammal, 
but also for ex vivo modification of cells for reintroduction into a mammal. As for other methods of delivering genetic 
vaccines, if necessary, vaccine administration is repeated in order to maintain the desired level of immunomodulation. 
[0160] Genetic vaccine vectors (e.g., adenoviruses, liposomes, papillomaviruses, retroviruses, etc.) can be admin- 

50 istered directly to the mammal for transduction of cells in vivo. The genetic vaccines obtained using the methods of 
the invention can be formulated as pharmaceutical compositions for administration in any suitable manner, including 
parenteral (e.g., subcutaneous, intramuscular, intradermal, or intravenous), topical, oral, rectal, intrathecal, buccal (e. 
g., sublingual), or local administration, such as by aerosol or transdermally, for prophylactic and/or therapeutic treat- 
ment. Pretreatment of skin, for example, by use of hair-removing agents, may be useful in transdermal delivery. Suitable 

55 methods of administering such packaged nucleic acids are available and well known to those of skill in the art, and, 
although more than one route can be used to administer a particular composition, a particular route can often provide 
a more immediate and more effective reaction than another route. 

[0161 ] Pharmaceutical^ acceptable carriers are determined in part by the particular composition being administered, 
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as well as by the particular method used to administer the composition. Accordingly, there is a wide variety of suitable 
formulations of pharmaceutical compositions of the present invention. A variety of aqueous carriers can be used, e.g., 
buffered saline and the like. These solutions are sterile and generally free of undesirable matter. These compositions 
may be sterilized by conventional, well known sterilization techniques. The compositions may contain pharmaceutical^ 
acceptable auxiliary substances as required to approximate physiological conditions such as pH adjusting and buffering 
agents, toxicity adjusting agents and the like, for example, sodium acetate, sodium chloride, potassium chloride, cal- 
cium chloride, sodium lactate and the like. The concentration of genetic vaccine vector in these formulations can vary 
widely, and will be selected primarily based on fluid volumes, viscosities, body weight and the like in accordance with 
the particular mode of administration selected and the patient's needs. 

[0162] Formulations suitable for oral administration can consist of (a) liquid solutions, such as an effective amount 
of the packaged nucleic acid suspended in diluents, such as water, saline or PEG 400; (b) capsules, sachets or tablets, 
each containing a predetermined amount of the active ingredient, as liquids, solids, granules or gelatin; (c) suspensions 
in an appropriate liquid; and (d) suitable emulsions. Tablet forms can include one or more of lactose, sucrose, mannitol, 
sorbitol, calcium phosphates, corn starch, potato starch, tragacanth, microcrystalline cellulose, acacia, gelatin, colloidal 
silicon dioxide, croscarmellose sodium, talc, magnesium stearate, stearic acid, and other excipients, colorants, fillers, 
binders, diluents, buffering agents, moistening agents, preservatives, flavoring agents, dyes, disintegrating agents, 
and pharmaceutical^ compatible carriers. Lozenge forms can comprise the active ingredient in a flavor, usually sucrose 
and acacia or tragacanth, as well as pastilles comprising the active ingredient in an inert base, such as gelatin and 
glycerin or sucrose and acacia emulsions, gels, and the like containing, in addition to the active ingredient, carriers 
known in the art. It is recognized that the genetic vaccines, when administered orally, must be protected from digestion. 
This is typically accomplished either by complexing the vaccine vector with a composition to render it resistant to acidic 
and enzymatic hydrolysis or by packaging the vector in an appropriately resistant carrier such as a liposome. Means 
of protecting vectors from digestion are well known in the art. The pharmaceutical compositions can be encapsulated, 
e.g., in liposomes, or in a formulation that provides for slow release of the active ingredient. 

[0163] The packaged nucleic acids, alone or in combination with other suitable components, can be made into aerosol 
formulations (e.g., they can be "nebulized") to be administered via inhalation. Aerosol formulations can be placed into 
pressurized acceptable propellants, such as dichlorodifluoromethane, propane, nitrogen, and the like. 
[0164] Suitable formulations for rectal administration include, for example, suppositories, which consist of the pack- 
aged nucleic acid with a suppository base. Suitable suppository bases include natural or synthetic triglycerides or 
paraffin hydrocarbons. In addition, it is also possible to use gelatin rectal capsules which consist of a combination of 
the packaged nucleic acid with a base, including, for example, liquid triglycerides, polyethylene glycols, and paraffin 
hydrocarbons. 

[0165] Formulations suitable for parenteral administration, such as, for example, by intraarticular (in the joints), in- 
travenous, intramuscular, intradermal, intraperitoneal, and subcutaneous routes, include aqueous and non-aqueous, 
isotonic sterile injection solutions, which can contain antioxidants, buffers, bacteriostats, and solutes that render the 
formulation isotonic with the blood of the intended recipient, and aqueous and non-aqueous sterile suspensions that 
can include suspending agents, solubilizers, thickening agents, stabilizers, and preservatives. In the practice of this 
invention, compositions can be administered, for example, by intravenous infusion, orally, topically, intraperitoneal^, 
intravesically or intrathecal ly. Parenteral administration and intravenous administration are the preferred methods of 
administration. The formulations of packaged nucleic acid can be presented in unit-dose or multi-dose sealed contain- 
ers, such as ampoules and vials. 

[0166] Injection solutions and suspensions can be prepared from sterile powders, granules, and tablets of the kind 
previously described. Cells transduced by the packaged nucleic acid can also be administered intravenously or 
parenterally. 

[0167] The dose administered to a patient, in the context of the present invention should be sufficient to effect a 
beneficial therapeutic response in the patient over time. The dose will be determined by the efficacy of the particular 
vector employed and the condition of the patient, as well as the body weight or vascular surface area of the patient to 
be treated. The size of the dose also will be determined by the existence, nature, and extent of any adverse side- 
effects that accompany the administration of a particular vector, or transduced cell type in a particular patient. 
[0168] In determining the effective amount of the vector to be administered in the treatment or prophylaxis of an 
infection or other condition, the physician evaluates vector toxicities, progression of the disease, and the production 
of anti-vector antibodies, if any. In general, the dose equivalent of a naked nucleic acid from a vector is from about 1 
jug to 1 mg for a typical 70 kilogram patient, and doses of vectors used to deliver the nucleic acid are calculated to 
yield an equivalent amount of therapeutic nucleic acid. Administration can be accomplished via single or divided doses. 
[0169] In therapeutic applications, compositions are administered to a patient suffering from a disease {e.g., an 
infectious disease or autoimmune disorder) in an amount sufficient to cure or at least partially arrest the disease and 
its complications. An amount adequate to accomplish this is defined as a "therapeutically effective dose." Amounts 
effective for this use will depend upon the severity of the disease and the general state of the patient's health. Single 
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or multiple administrations of the compositions may be administered depending on the dosage and frequency as re- 
quired and tolerated by the patient. In any event, the composition should provide a sufficient quantity of the proteins 
of this invention to effectively treat the patient. 

[0170] In prophylactic applications, compositions are administered to a human or other mammal to induce an immune 
response that can help protect against the establishment of an infectious disease or other condition. 
[0171] The toxicity and therapeutic efficacy of the genetic vaccine vectors provided by the invention are determined 
using standard pharmaceutical procedures in cell cultures or experimental animals. One can determine the LD 50 (the 
dose lethal to 50% of the population) and the ED 50 (the dose therapeutically effective in 50% of the population) using 
procedures presented herein and those otherwise known to those of skill in the art. 

[0172] A typical pharmaceutical composition for intravenous administration would be about 0.1 to 10 mg per patient 
per day. Dosages from 0.1 up to about 1 00 mg per patient per day may be used, particularly when the drug is admin- 
istered to a secluded site and not into the blood stream, such as into a body cavity or into a lumen of an organ. 
Substantially higher dosages are possible in topical administration. Actual methods for preparing parenterally admin- 
istrable compositions will be known or apparent to those skilled in the art and are described in more detail in such 
publications as Remington's Pharmaceutical Science, 15th ed., Mack Publishing Company, Easton, Pennsylvania 
(1980). 

[0173] The multivalent antigenic polypeptides of the invention, and genetic vaccines that express the polypeptides, 
can be packaged in packs, dispenser devices, and kits for administering genetic vaccines to a mammal. For example, 
packs or dispenser devices that contain one or more unit dosage forms are provided. Typically, instructions for admin- 
istration of the compounds will be provided with the packaging, along with a suitable indication on the label that the 
compound is suitable for treatment of an indicated condition. For example, the label may state that the active compound 
within the packaging is useful for treating a particular infectious disease, autoimmune disorder, tumor, or for preventing 
or treating other diseases or conditions that are mediated by, or potentially susceptible to, a mammalian immune re- 
sponse. 

EXAMPLES 

[0174] The following examples are offered to illustrate, but not to limit the present invention. 
Example 1 

Development Of Broad-Spectrum Vaccines Against Bacterial Pathogens And Toxins 
A. Evolution of Yersinia V-antigens 

[01 75] This Example describes the use of DN A shuffling to develop immunogens that produce strong cross-protective 
immune responses against a variety of Yersinia strains. Passive immunization with anti-V-antigen antibodies or active 
immunization with purified V-antigen can provide protection from challenge with a virulent autologous Yersinia species. 
However, protection against heterologous species is limited (Motin et al, (1994) Infect, Immun, 62: 4192). 
[0176] V-antigen genes from a variety of Yersinia strains, including serotypes of Y pestis, Y enterocoiitica, and Y 
pseudotuberculosis are subjected to DN A shuffling as described herein. The Yersinia pestis V antigen coding sequence, 
for example, is used as a query in a database search to identify homologous genes that can be used in a family shuffling 
format to obtain improved antigens. Results for a BLAST search of GenBank and EMBL databases are shown in Table 
1 , in which each line represents a unique sequence entry listing the database, accession number, locus name, bit score 
and E value. See, Altschul et al, (1997) Nucleic Acids Res. 25:3389-3402, for a description of the search algorithm). 
Homologous antigens have been cloned and sequenced from a number of related yet distinct Yersinia strains and 
additional natural diversity is obtained by cloning antigen genes from other strains. These genes and others or frag- 
ments thereof are cloned by methods such as PCR, shuffled and screened for improved antigens. 



Table 1 



Sequences producing significant alignments 






Database/Accession No. 


Gene 


Score 
(bits) 


E Value 


gb|M26405|YEPLCR 


Yersinia pestis 1crG, 1crV, and 1crH genes, co 


1945 


0.0 


gb|AF053946|AF053946 


Yersinia pestis plasmid pCD1 , complete pla 


1945 


0.0 
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Table 1 (continued) 



Sequences producing significant alignments 






Database/Accession No. 


Gene 


Score 
(bits) 


E Value 


emb|X96802|YPTPIVANT 


Y. pseudotuberculosis V antigen gene 


1834 


0.0 


gb|M57893|YEPLCRGVHP 


Yersinia pseudotuberculosis V-antigen 


1818 


0.0 


gb|AF080155|AF080155 


Yersinia enterocolltlca pYV LcrV (IcrV) antigen 


1723 


0.0 


emb|X96801 |YE96PVANT 


Y enterocolltlca V antigen gene, strain Y-... 


1667 


0.0 


emb|X96799|YE1 08VANT 


Y enterocolltlca V antigen gene, strain Y-... 


1659 


0.0 


emb|X96800|YE527VANT 


Y enterocolltlca V antigen gene, serotype ... 


1651 


0.0 


emb|X96798|YE808VANT 


Y enterocolltlca V antigen gene, strain 8081 


1643 


0.0 


emb|X96796|YE314VANT 


Y. enterocolltlca V antigen gene, strain WA. 


1237 


0.0 


emb|X96797|YENCTVANT 


Y enterocolltlca V antigen gene, strain NCTC 


1221 


0.0 


gb|S38727|S38727 


IcrGVH operon: lcrV=V-antigen [Yersinia pseudo.] 


365 


9e-99 



[0177] Shuffled clones are selected by phage display and/or screened by ELISA to identify those recombinant nucleic 
acids that encode polypeptides that have multiple epitopes corresponding to the different serotypes. The shuffled 
antigen genes are cloned into a filamentous phage genome for polyvalent phage display or a suitable phagemid vector 
for monovalent phage display. A typical protocol for panning antigens by phage display is as follows. 

Coat an appropriate surface (e.g., Nunc Maxisorp tube or multiwell plate) at 4°C overnight with the target antibody, 
usually at a concentration of 1 -1 0 |ug/ml in PBS or other suitable buffer 

Rinse and Block with PBSM (PBS + 3% nonfat dry milk) at 37°C for 1 -2 hr 

Pre-block phage if needed (PBSM, RT 1 hr) 

Rinse tube and allow phage to bind (usually 1 hr (g> 37°C) 

Can vary time, temp, buffer, add a competitive inhibitor, etc. 

• Wash extensively (15x) with PBST (PBS + 0.1% TWEEN20), then PBS 

Elute bound phage with low pH {e.g., 10 mM glycine), 100 mM triethylamine, competitive ligand, protease, etc. 
and then neutralize pH if needed. 

Infect E. coll with eluted phage to transduce expression phagemid into new host. Titer and plate for colonies on 
drug plates 

Pool colonies into media, grow cells and infect with helper phage to produce phage for next round 

[0178] Phage ELISA assays are a useful method to rapidly evaluate single clones after panning of libraries. Single 
colonies are picked in individual wells of a multiwell plate containing 2YT media and grown as a master plate. A replicate 
plate is infected with helper phage and grown so that phage from a single well will display a single antigen variant. A 
suitable protocol for phage ELISA assays is as follows. 

Coat microtiter plate with 50 jllI of 1 |wg/ml target antibody 4 °C overnight 

Rinse and block with PBSM for 2 hrs @ 37 °C 

Rinse, add preblocked phage and allow to bind 1 hr @ 37 °C 
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Wash plates with PBST 3x, then PBS 3x with 2 min soaks 

Add HRP(or AP)-conjugated anti-M13 antibodies for 1 hr @ 37 °C 

Add substrate and measure absorbance 

Identify positive clones for further evaluation 

[0179] ELISA assays can also be used to screen for individual antigens with multiple epitopes or increased expression 
levels. Single colonies are picked in individual wells of a multiwell plate containing appropriate media and grown as a 
master plate so that antigens produced from a single well are a single antigen variant. A replicate plate is grown and 
induced for protein production, e.g., by addition of 0.5 mM IPTG for Lac repressor-based systems and grown for an 
appropriate time for the antigen to be produced. At this point a crude antigen preparation is made which depends on 
the antigen and where it is produced. Secreted proteins can be evaluated by assaying the cell supernatants after 
centrifugation. Periplasmic proteins are often readily released from cells by simple extraction into hyper- or hypo-tonic 
buffers. Intracellular^ produced proteins will require some form of cell lysis such as detergent treatment to release 
them. A suitable protocol for ELISA assays is as follows. 

Coat microtiter plate with 50 jllI of 1 jig/ml target antibody 4 °C overnight 

Rinse and block with PBSM for 2 hrs @ 37 °C 

Rinse, add antigen prep and allow to bind 1 hr @ 37 °C 

Wash plates with PBST 3x 

Add HRP(or AP)-conjugated secondary antibody and incubate for 1 hr @ 37°C 

Add appropriate substrate and measure absorbance 

Identify positive clones for further evaluation 

[0180] Antibodies specific for many of the various antigens are commercially available (e.g., Toxin Technology, Inc, 
Sarasota, FL) or can be generated by immunizing suitable animals with purified antigens. Protein A or Protein G Sepha- 
rose (Pharmacia) can be used to purify immunoglobulins from the serum. Various affinity purification schemes can be 
used to further purify family-specific antibodies if needed such as immobilization of specific antigens to NHS-, CNBr-, 
or epoxy-activated sepharose beads. Other related antigens may be included soluble form to prevent binding and 
immobilization of cross-reactive antibodies. 

[0181] The multivalent polypeptides that are identified by the initial screening protocol are purified and subjected to 
in vivo screening. For example, the shuffled antigens selected by a combination of any or none of these methods are 
purified and used to immunize animals, initially mice, which are then evaluated for improved immune responses. Typ- 
ically 10 micrograms of protein is injected to a suitable location with or without appropriate adjuvant, e.g., Alhydrogel 
(EM Seargent Pulp and Chemical, Inc.) and the animals are boosted with an additional dose after 2-4 weeks. At this 
point serum samples is drawn and evaluated by ELISA assay for the presence of antibodies that cross-react against 
multiple parental antigens. In this ELISA assay format the antigens are coated onto multiwell plates, then serial dilutions 
of each sera is allowed to bind. After washing unbound antibodies, a secondary HRP- or AP- conjugated antibody 
directed against the appropriate test antibody constant region, e.g., goat anti-mouse IgG Fc (Sigma) is bound. After 
another washing, the appropriate substrate is added, e.g., O-phenylenediamine (Sigma). The absorbance of each well 
is read by a plate reader at the appropriate wavelength (e.g., 490 nm for OPD) and those producing high antibody 
titers to multiple antigens are selected for further evaluation. 

[0182] Additionally, the ability of antigens to generate neutralizing antibodies can be evaluated in an appropriate 
system. Antigen variants that elicit a broad cross-reactive response are evaluated further in a virulent challenge model 
with the appropriate pathogenic organism. For example, the multivalent polypeptides are used to immunize mice, which 
are then challenged with live Yersinia bacteria. Those multivalent polypeptides that protect against the challenge are 
identified and purified. 
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[0183] This Example describes the use of DNA shuffling to obtain multivalent polypeptides that are effective in in- 
ducing an immune response against a broad spectrum of bacterial toxins. 

1. Staphylococcus 

[0184] The Group A Streptococci, which can cause diseases such as food poisoning, toxic shock syndrome, and 
autoimmune disorders, are highly toxic by inhalation. The family of Group A Streptococcus toxins numbers about 30 
related members, making this group a suitable target for family shuffling. Accordingly, this Example describes the use 
of family DNA shuffling to create chimeric proteins that are capable of eliciting broad spectrum protection. 
[0185] Nucleic acids that encode many diverse attenuated toxins are subjected to DNA shuffling as described herein. 
Table 2 shows the output of a BLAST search of GenBank, PDL, EMBL, and Swissprot using the S, aureus enterotoxin 
B protein to identify homologous genes that may be used in a family shuffling format to obtain improved antigens. 



Table 2 



Sequences producing significant alignments 






Database/Accession No. 


Gene 


Score 
(bits) 


E 

Value 


sp | P0 1552 | KTXB -STAAU 


"ENTEROTOXIN TYPE U PRECURSOR (SEE J >c 


554 


e-157 


pdb|l£iE3| 


staphylococcal Enterotoxin ±j compiexc 
Tri . . . 


'504 


'e-142 


pdb|lSEB'|D 


stapnyiococcus aureus >gi } ib3334B jpot 
Staphyl . . . 


"405 


e-I13 


sp 


F23313 STAAU 


ENTEROTOXIN T¥PB 0-3 PRECURSOR (SEC3; 


'376 


e-103 


sp 


POlSbJ |ETC1JSTAAU '" 


ENTEROTOXIN TXJPJE <J-1 FKEtJURSOR (SECli 


368 


e-101 


sp 


F34U7l)£TU2 STAAU 


ENTEROTOXIN 'VYPE C-2 PRECURSOR" (EEC21 


361 


' 2e-99 


gi 


2S>514S 1 - 


enterotoxin istapnylo coccus 


338' 








tJ-iiJ^7aj enterotoxin istapnyiococcus 


332 


le-90 




'29S143 


<iii337 5] enterotoxin istapftyiococcus 


330 


4e-yo 


gi 




(Xj13 37 8J enterotoxin istapnyiococcus 


329 


Ie-B9 




cnain fl, i'-uexj. Receptor seta L'nam t. 
With S. . . 


328 


2e-B9 1 


-gTpr95141' 


liti3374 ) enterotoxin istapnyiococcus 


325 


3e-89 
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Sequences producing significant alignments 






Database/Accession No. 


Gene 


Score 


E 

Value 


pOD | 15E2 1 


staphylococcal isnterotoxm c2 , Monoc. 
ante . . . 




4e-B9 


gi|19060S2 


iuyi526j type c encerotoxin istiapnyic 

A ^ a a ^3 

liiteirTueQ.. . . 




~5e-B9 


gi|29bl47 1 


iLU33/7} enterotoxm l staphylococcus 


323 


/e-BB 


DDS | 155101 


enterotoxin^pyrogemc toxin [stapnyic 
4446, P. > . 


319 


le-85" 


gi | 4 / b /o4 


(ii2956b} superantigen (.streptococcus 


ill' 


3e-d4 


gij 1245172 


(U4B792J superantigen ssa istreptococ 
pyogene s j > . . . 


31U 


4e-84 


gi| 1245174 


(U4B793J superantigen ssa tstreptococ 
pyogenes j 


3oy 


ie-B3 


sp | P0809S | SPEA__STRPY 


EXOTOXIN TVPE A PRECURSOR (SCARLET FJ 




2e-5B 


gi | 4 /2b» 


^A6i560j type A exocoxin istreptococ( 
pyogenes] >gi| • - * 


211" 


ie-i>4 


pir ) ) bio / b J 


exotoxin type A precursor (aiieie 3j 
□crepLOCoccu . * . 


211 


4e-b4 


pir||Sia-7B6 

"wtv r t TTUTins — "1 


exotoxin type A precursor ( allele D 
Dtrepcocuccu - . . 


209 


2e-53 




exotoxin A precursor (alleXe 4j - st2 
py°» • * 


206 


8e-53 


gi|47328 


(A61554J type A exotoxin istreptococc 


iy6 


ye-5u 


pj>4 | | rtZ. O X 3 £t 


ancpLyLuuwai y y i. uy cj.ij-w cjiuuu.jiJLi4 uypt 


185 


2e-4b 


sp j P2U723" | ETXD STAAU 


ENTEROTOXIN T*PE D PRECURSOR (SED) >^ 


131 


3e-^o 






129 


l2e- 5 9 






12o 


3e**29 




o ua^Jiiy i,utULL etui tr u a *>yj. j ,1 o .j ,3 ^ j .j I pux 

Staphyl. . . 


125 


2e-28 






125 


2e-2b 






118 


3e-2& 


gi|biU692 


IUH/U2J enterotoxm u Lataptiyiococci 
>qi 1 10 ♦ . . 


98 


7e-20 


gi | 149047 


IM94S72) enterotoxin D LPlasmid plB4J 


89 


2e-i7 


gi|2689b53 


tU9368B) enterotoxin tatapnylococcus 


75 


2e-U 


gij 15378b 


IN97156J pyrogen! c exotoxin c istrepi 
pyogenes . . . 


rn 


Be ~ 0 8 


sp|P13380|SPEC 5TRPV 


EXOTOXIN TiPE C PRECURSOR C'SPH CJ 




Be-OB 


gi |529754 


(U0255 9; spec LStreptococcus pyogenes 


56 


2e-07 


pir | |A3U5D9 


exotoxin c precursor - streptococcus 
>gi|l... 


56 


2e-07 


gi|b297bb 


;uo2560i spec streptococcus pyogenei 


55 


4e-0 7 


pir | | 527240 


enterotoxin B - stapnyiococcus 
aureus (fragments) 


53 


le-U6 



[0186] Shuffled recombinant clones are initially selected by phage display and/or screened by ELISAforthe presence 
of multiple epitopes from the different families. Variant proteins with multiple epitopes are purified and used to for in 
vivo screening as described above. The mouse sera are analyzed for antibodies specific for different toxin subtypes 
and variants that elicit broadly cross-reactive responses will be evaluated further in challenge models. 

2. Escherichia coli and Vibrio cholerae 

[0187] This Example describes the use of DNA shuffling to obtain cross-reactive multivalent polypeptides that induce 
an immune response against the E. coii heat-labile toxin (LT), cholera toxin (CT), and verotoxin (VT). Nucleic acids 
that encode cholera and LT toxin B-chains are subjected to DNA shuffling. Table 3 shows the results of a BLAST search 
using the V. cholerae toxin B-chain to identify homologous genes that can be used in a family shuffling format to obtain 
improved antigens. Homologous antigens have been cloned and sequenced from a number of related yet distinct Vibrio 
and E, coli strains, and additional natural diversity can be obtained by cloning antigen genes from other strains. These 
genes and others or fragments thereof can be cloned by methods such as PCR, shuffled and screened for improved 
antigens. 
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Table 3 



Sequences producing significant alignments 






Database/Accession No* 


Gene 


(bits) 


IT 

Value 


sp|P01bb&| CHTBJ/IBCH 


CHOliBRA enteroi'uXIN , BETA CHAIU PRECt 


2b2 


5e-67 


gi | 4b890 


1X587 Bb) cholera toxin B protein (CTI 
cholera . . . 




be-t>e> 


gi | 75b:4bl 


utuuivi; ctx a [viario cnoieraej 


246 


lie- 6b 


pri 


| 1UU1196A 


toxin, cnoiera iviorio cnoieraej 


246 


ie-6b 


gm 


PlD|dl006S53 


tD3 0U5ij; cnoiera toxin ivicno cholei 


244 


le-64 


pir 


|XVVCB 


cnoiera enterotoxm cnain b precursor 
cholerae 


241 


ie-63 


gi | -iuy^bb 


<M23 05U> cnoiera toxin suounit B prec 
[Artificia. . . 


22B" 


7e-60 


bbs|l£>B00b 


cnoiera- HKe enterotoxm a summit i\ 
cholerae, . . . 


211 


ie~54 


spjP13Bll|KUiH KCOL1 


HKAT-JjABIIiK KMT1JKUTOX1U B UHA1K PRECl 


209 


t>e-b4 


pdb|lXTC|U 


"Vibrio cholerae >gi | lB27Hbl | pdb | 1XTC 
choler. . . 


207 


iie-hJ 


pdb|IFUB|U " "' ^ 


viono cnoierae >gi 1 1942S3 9 | pdb 1 1FGB 
choler . . , 


207 




pdb|2CHB|D 


unain u, cnoiera Toxin B-pentamer Cor 
With Gml . . . 


207 


2e-53 


pdb|lCHP|U ~ 


Viono cnoierae >gi ) i4ii ±z> \ pdb 1 1CHP 
choler. . . 


20b 


le-b2 


pdb|ICHQ|D"" 


vinrio cnoierae >gi j i42ib^b j pdb | 1CHQ 
choler. . . 


20b 


le-52 


pdb|lCTl|D- 


unam u, unoiera Toxin B-Pentamer Mut 
Bound . . > 


204 


"le-52 1 


sp]P32H90|KLBP ECOLI 


HEAT-LAB1LU KAITSKOTOXIN B CHAIN PKSq 


204 


2e-S2 


prt | [D701264A ' ' 


toxin iiTB cis tron, neat laone tsscnea 
coli] 


201 


9e-52 


pir| ImjisuB 


neat-iaone enterotoxm cnam b preci 
Escheric . ♦ . 


201 ■ 


2e-5i 


bbs | 131495 1 


tS6073i) neat-iaone enterotoxm B si 
B SU. . . 


200 


2e-bl 


prt | |770iyUA 


toxin ivinrio cnoieraej 


199 


6e-51 


pdb|lLTA(U ■' ■' 


sscnencnia con >gi 1 4y42&6 | pdb 1 1LTA 
Escherichia c . , . 


179 ' 


4e-4b 



Sequences producing significant alignments 






Database/Accession No* 


Gene 


Score 
(bits) 


E 

Value 


pdb|lTKT|P 


viDrio cnoierae 


34 


0.31 



[0188] Those chimeric toxins that elicit high levels of neutralizing antibodies against both toxins and have improved 
adjuvant properties are identified. For example, shuffled clones are selected by phage display and/or screened by 
ELISA assays for the presence of epitopes from the different parental B-chains. Variants with multiple epitopes are 
purified and further studied for their capacity to act as adjuvants and to elicit cross-protective immune responses in 
challenge models. 

Example 2 

Evolution of Broad-spectrum Vaccines against Borrelia burgdorferi 

[0189] Lyme disease is currently one of the fastest-growing infectious diseases in the United States. It is caused by 
infection of the spirochete bacterium Borrelia burgdorferi, which is carried and spread by the bite of infected ticks. Early 
signs of infection include skin rash and flu-like symptoms. If left untreated Lyme disease can cause arthritis, heart 
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abnormalities, and facial paralysis. Treatment of early Lyme disease with antibiotics can stop the infection, but a lasting 
immunity may not develop making reinfection possible. A current vaccine requires three immunizations over a 1-year 
period to acquire immunity. 

[0190] Both passive and active immunization with the purified B. burgdorferi outer surface protein A (OspA) protein 
has been successful in protecting against infection with B. burgdorferi, but has no effect against ongoing infections, 
since this antigen is not expressed in vertebrate hosts. OspA is normally anchored on the outside of the cell by a 
covalently attached lipid moiety through an amino terminal cysteine residue. In contrast, the outer surface protein C 
(OspC) is highly expressed by the spirochete in vertebrate hosts and vaccination of infected individuals with OspC 
may be an effective therapeutic in curing the infection (Zhong etal. (1 997) Proc. Natl Acad. Sci. USA 94 1 2533-1 2538. 
[0191] A recent BLAST search (Altschul, etal., (1997) Nucleic Acids Res. 25:3389-3402) of the non-redundant Gen- 
Bank, PDB, SwissProt, Spupdate, and PIR databases was used to identify homologues of the OspA outer surface 
protein gene. This resulted in the identification of over 200 entries related to OspA. One hundred entries are shown in 
Table 4 below from different strains of B. burgdorferi. B. garinii. B. afzelii, B. tanukii, and B. furd/that share at least 83% 
DNA sequence identity to the Borrelia burgdorferi OspA protein. The ospA genes from these and other strains provide 
a source of diversity for family shuffling to obtain improved antigens for the prevention of Lyme disease. These genes 
are cloned by methods such as PCR, shuffled and screened for improved antigens. 
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Table 4 



Sequences producing significant alignments 






Database/ Accession No. 


Gene 


Score 
(bits) 


E 

Value 


dbj|ABai6yV7|AS0165»77 


Borrelia sp. gene tor outer surrace 
prote . . . 


iot 


-0TO 


db3 |ABDlfiS7B|AB0I"697B " 


Borrexia sp. iumt gene ror outer 
surface . ■ . 


1614 


□ . 0 


db3 |AB0Iby7b |AB01*975 


aorrena turai gene ror oucer sure ace 
pro ... 


1H26 


0.0 


"aETj"! AT3'0 1 6 976' ] ABO lb 97 fc 


prote , . . 


1187 


"0.0 


"gb | S4 83 23"] 34 83 23" "" 


ciKTiAsnucer suriafp rifot-pin a i Hnrrp 1 is 1 
burgdor. . . 


y4B 


0.0 


gD | b J B 6 b 7 | BUKfrKA 


Borrelia Durgaorreri tcione wjj ospA 
crene f raa . . . 


y4B 


0.0 


embjXBOlBS |BBPTKOOPS"" "" '"" 


b . Durgaorr en Pxro ospA gene 


W8 


u. u 


emb|Xbbby8|BBUSPAl' 


B.Durgaorreri usp a gene (tku; 


y3B 


0,0 


gb j U^U J'5"77"BBU20'3S7 


Borrelia buraflort en r-i-n outer 

surface pr. . . 


876 1,1 


0.0 " 


gb | fSBSCTS — 


mit't*!" Riirtace DT'otein a 1 Horrp 1 is "" ™" 

burgdorferi, . . . 


Bbb 




" effib | X5 5 0 5 5'f'BBUSPftOA 


B . Durgaorreri uspA gene tor outer 
surface p . . . 




0.0 


emb |X85440| BGTISOSFa 


B.garimi ospA gene (Tlsl substrain) 


ajy 


0.0 


emb|XB543B|BAPLJ05PA"" 1 


B.arzem ospA gene {vuyt suostrain) 
>gi | 9 . . . 


"537 


o:o 


e mb | X B 0 1 K ' 1 'BEFBOOSPA* 


b .Durgaorreri i-'Bo ospA gene 


829' 




emb j 229087 j BBOSPAY 


B.Durgaorreri iVB4tsij uspA gene tor 
outer su . . . 


'821" 


0.0 


emb|XHbyU2 IBADNAOSPA' 


B.arzein ospA gene 


uai 


0.0 


emoixb^lbl |BBUBPAU 


B.Durgaorreri piasmia ospA gene tor 
outer su. . . 


BZ1 


o.o 


gb|U7B301|HBU7B301 


Borrexia arzem major outer membrane 
surf ac . . . 


B21 


0.0 


embjXfebbyy|HBOSPA2 


B.burgdorterx Osp A gene {FKOJ 


815) 


0.0 


emb ! X7TT3 - 5"b""] BH0P5AA 


B.Durgaorreri uspA gene 


BI9 ' 


0.0 


emb 1 X854 j y | BAPLUOSPA 


B.arzeiii ospA gene IPL'ud substrain) " 


B1J 


0.0 


emb|X8IU47|BBOPSA~ 


B.Durgaorreri piasmia uspA gene 


B13 


u.o 


gb |V2035b |BAU203bb 


Horre I i a atzeln W\7 1 miliar* mir^aro' 

protein A. . . 


813 


u . u 


emb | X U b 4 3 7 | BAPH005PA"" " 


h ar2ein osda aene ipho subs train \~ ' ' "~ 1 


757 


0.0 


emo | xb u^b J | tf bPWUULL 


B.burgaorteri fwuaix ospA gene 


Tsn — 


"TT."0 


emb'j 22908b |BH05PAX 


b . ourgaorten tti^bj uspA gene tor "'outer " 
surf . . . 


vyi 


0 . 0 


gb|Uy7U2|BUHMAJOSPR " ' 


Borrelia burgaorreri outer surtace 
protein. . . 


791' 


0.0 


emb!X623B7|UBSPA ' 1 


B.Durgaorreri ospA gene tor outer 
surface prot . . . 


/By 


0.0 


emb|Xt03U0!BSA5PA ' - " 


B.Durgaorreri gene ror uspA outer 
surface pro. . . 


■78b 


0.0 


emb|X62b24 IBBK480SFA 


B.Durgaorreri ospA gene 


767 


0.0 " 


gb|HBB7b4 IBOROSPABA " " " 


Borrelia ourgaorren operon ma3or outer 
mem ... 




-d:o-' ■ 


embtX&3417IBBPUBPA 


B.Durgaorreri piasmia ospA gene tor 
outer su . . . 


vby 


0.0 


gbjLJbUJfclBGROSPAL- "' 


Borrena Durgaorreri outer surface 
protein A. . . 


743 


0.0 
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Sequences producing significant alignments 








VJvUC 


Score 
(bits) 


E 

Value 


gbIU2U3bB|BGU203bH 


Borrexia garmn x,v4 outer surrace 
protein A. . * 


743 


o.o 


'emb|X8b'442|BB2y705PA 


B. Durgaorrerx ospA gene i^y/ suostram) 


714 


"0.0 1 


gb|L19701|BOKOPBAB 


Borrexxa Durgaorrerx ma 3 or outer 
surface pro. . 


714 


u.o 1 


emb|X!4407|BBUb!PAB 


Borreiia Durgaorrerx ospA ana ospa 
genes for. . . 


706 


0.0 - 


gb|AK000790|AE0007yO 


Borreiia Durgdorrerx piasmia xpb-4, 
complet . . . 


7U6 


0.0 


gblU , 2036U|BBU20360 


Borrexia Durgaorrerx 5-1-iu outer 
surface pr . . . 


7U6 


0.0 


ernb|X69606|BBKA0SPA 


B. Durgaorrerx usda gene 
>gi|1819262|gb|I284. . . 


706 


0.0 


clb3 |AB0071UO|AB007100 


Borrexxa garxnxx gene ror outer surtace 
p. . . 


702 


0.0 


gb|Mb724B|BUUOSPA " 


B.Durgaorrerei outer sun ace protein a 
(OspA) ... 


'698 


0"0 


embTXBOlU^'l BBPKAOP&A 


b. Durgaorrerx t>Ka ospA gene 


698 


0.0 


OCJ IAB0071U1 IAB007101 


Borrexxa garxnn gene ror outer surrace 
p. 


694 " " 


0.0 


emb|XBb44J|BBT2b'OSPA 


b. Durgaorrerx ospA gene iTZbb 
substrain) 


692 


0.0 


emb|X16467|BBOyPA 


Borrexxa Durgaorrerx uspA gene ror 
outer surf ... 


69U 


u.o 


gb|U203b9|BBU20359 


borrexxa sp. iiVb outer surrace prdtexn 
A pre — 


690""" " 


0.0 


gb|A*'0260b9 | AF0250b9 


aorrexia Durgaorrerx bu ku& plasraxd 
lipopr. . . 


682 


0.0 


emb 1 Xa b7 J y | BBDNAOSPA 


b. Durgaorrerx ospA gene 




u . u 


emb | XB b44 1 1 BUWABOSPA 


a.garxnix ospA gene iwabsou suostraxn) 


57b 


0.0 1 


gb|imvuy |U93709' 


Borrexxa garxnxx outer surrace protexn 
A (ospA. . . 


674 


u.o 


db] |ABUUV0yy|ABU07099 


aorrexia garxnix gene ror outer surrace 
p. . . 


662 


0.0 


emb | XB02S 1 1 BBPHKIOSP 


B.ourgaorreri ftiex ospA gene 


66ir ' 


0.0 


go)U4yiyu Js(iu49190 


Borrexxa garinxx ma3or outer memDrane 
surf ac. . . 




0.0 


etnb|X6b600|BBOSPA3 


B. burgaorreri Usp A gene tHtlJ 


6b2 


"0.U 


db3 |U29660|U2y660 


Borrexxa Durgaorrerx gene ror 
outer surf ace pr. . . 


652 


0.0 


db 3 |AB015979|ABD16979 


Borrexxa vaxaxsxana gene ror outer 
surf ac , . . 


648 ' 


0.0 


db D lAB007I09|AB0U710y 


Borrexxa garxnxx gene ror outer surtace 
p... 


646 


0.0 


gb|U93707 (U93707 


Borrexxa garxnxx outer surrace protexn 

A (OSpA. . . 


644"' 


0.0 


dbj | AB007102 (AB0071TJ2 


Borrexia garxnxx gene ror outer surtace 
p... 


642 


0.0 


db3|ABD01C4I , [ABD'0rOTl 1 ™ 


Borreixa garinn UNA tor outer surtace 
pr. . . 


636 


e-180 


db3 |ABUU7U4 |AB007114 


Borreixa garxnxx gene ror outer surrace 
p. . . 


632 


e-179 


db3 (ABDU7105TABOD71TJB " 


Borrexxa garxnxx gene ror outer surtace 
p.. . 


T32 " 


'e-179 


go| uyj/iu |u93V10 


Borreixa garxnxi outer surrace protexn 

A (OSpA. . , 


bjib 


e-X7b 


"aUb3 | AB007106 IAB007106 


Borreixa garxnix gene ror outer surtace 
p. . . 


624 1 


e-177 


db 3 | AB0U71U4 (AB007104 


Borrexia garinxx gene tor outer surtace 
p.. . 




e-x / ; 


gb|U93706|U93706 


Borrexxa garxnix outer surrace protexn 
A (oapA. . . 


620 


e-176 


emb|XB0256 I SHPBIJDSPS" 


B.gamxx FBr ospA gene 


61J 


e-173 


"'db3 l |AB0071UB|ABD , D71DB 


Borrexxa garinii gene ror outer surrace 
p. . . 


60 7 


e-171 


~gb | 1.81129 I BOROSPAY 


Borreiia burgdorferi (isolate 2-14yy 
297) ou. . . 


bUb 


e-171 ' 
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Sequences producing significant alignments 






Database/ Accession No. 


Gene 


Score 
(bits) 


E 

Value 


gb|U9JVU|Uyj/U 


Borrelia garinii outer surtace protein 
A (ospA* • • 




e-171 


ODJ |AB00710J JABUU 710 S 


Borrelia garinii gene tor oucer surtace 
p. - . 


bBV 


e-JLbb 


go [ libll^B | BORUbFA^ 


Borrelia burgdorren i isolate i-iwa 
Son 188 . 


b'Bl 


e-164' 


gb|L231J7|BOKOi!PAC 


"Borrelia 'Burgdorferi (27yBbCT2J OspA 


577 


e-ibii 


gb|L23IJy IBUkOSPAJ*: 


Borrelia burgdorteri {4'Zj~/3tiX3) ospA 
gene t 3 - * • 


b7V 




gb|L^142|B0kUUyAl 


"^Borrelia ourgaorren ica_w uspA gene, 
3 ' end 


K77 




y D { Ja^C jIjo 1 OUKUo trnn 


OiJi, Leila UUi^Uuiicii lui?s>i^ / WoJ/n 

gene , 3 1 e . . . 


H77 


e-lfia 




UftV* 1 **^ 1 i a na t 1 n i i mil - pr ^nrtarp rirrsf" pm 
A (ospA. . . 


b&y 


e-UO 


gu | JUa J J-tU 1 -DWrtkJO ri\c 


gene , 3 ' . . . 




e-lfoU 1 




Unrrp I i a etai-l m i nP'np tnr niifpr i >*4- a r< c> 

P. . . 


bfcb' 


e-lby 


ah 1 1,2 14 14 '-t 1 MnftrTTPKT' 


3 » end , . . 


S61 


e-lbB 




p. . . 


b57 


e-lbb" 


ClUf I AO U« Dt* j .DVJ 1 ^. D cAi 




bbV 


e-lbb 


9b|U93VOB|UWV0B 


aorreiia garinii oucer surtace procein 

n V VJCIJUA • * * 


HS3 


e-155 


OD3 j Abuo7i07 |abuu ;Iuv 


sorreiia garinii gene ror outer surtace 
p. . . 


l>49 


e-lM 


g-b|L23141|BOROSPAH 


Borrelia surgaorren iiii4iwi) uspA 
yene, j ... 


S4b 


e-lbJ 


db3 |AH0U71II|AB007111 


Borrelia garinii gene tor outer surtace 
p. . . 


541 


e-I'!>2 


Vi* >J 1 AOU V> 1 11J | null W ' JtiJ 


p. . . 


b37 


e-lbl 




3 1 end ... 




e-14B 


ah 1 TT7H5'4 5 1 RRIT7Rt;4 q 


ti Li 1. iav> * • • 


52b 1 


e-147 


db3 | AHtTUyBfci | ABUUyH63 


borrena garinii gene ror outer surtace 

y , , , 


519 


e-14S 




b . Durgciori er 1 uspA gene ror outer 
surface p. . - 


4Sb 


e-U9 


db} |ABUOyBb^|AH009BG2 


Borrena garinii gene tor outer surtace 
p . . . 


45£ 


e-129 


emb | X9£3bU | HBUaPWKA 


b.Durgaorren ospA gene ( strain ptra; 


46U 


e-127 


ettUD | X6ttb4 1 | BBPHtll 


B.tmrgaorreri ^heij piasmia uspA gene 
for ou. . . 


446' 


e-1^3 


eihb|JU>Bb4U|UBPUUm 11 


B.ourgaorteri tfwuai) piasmia uspA gene 
for ... 


414 


"e , -114 


emb JX9535B | B0OSPPLI 


B.garinn ospA gene t strain i^iiij 


414 


e-114 


db^ |ABUEjyB&U|AB0D9S60 


Borrena garinii gene ror outer surtace 
p.. . 


J9J 


e-107 1 


db3|ABD09Hb'B|A500955B 


Borrena garinii gene ror outer surtace 
p.. . 


Jfes 


Be-y9 


db} |AB0U9Htl|AB0O!JBCl 


Borrena garinii gene ror outer surtace 
P- - • 




^ie-74 



[0192] A BLAST search with the B. burgdorferi OspC protein gene revealed over 200 related entries. Entries for one 
50 hundred sequences sharing at least 82% DNA sequence identity are shown in Table 5 below that provide a source of 
diversity for family shuffling to obtain improved therapeutics in the treatment of Lyme disease. These genes are cloned 
by methods such as PCR, shuffled and screened for improved antigens. 
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Table 5 



Sequences producing significant alignments 






Database/ Accession No. 


Gene 


Score 
(bits) 


£ 

Value 


gb|U04^82 |BBU042B2 


Borrexia ourgaorreri omf synthetase 
(guaA) g. . . 


X^faX 


Trn ■ 1 


gb 1 1,4 2 H 9 B | HOkJIOaPC 


Borrelia ourgaorreri i strain ^buibj 
outer s . , . 


1124 " 


"0.0 


gb|UUlB94 [BBU01894 


Borrexia ourgaorreri b_ji outer surrace 
prote ... 




e-lVfe 


db j 1 1 D4 94 9 7 J HUUOKPCA 


Borrena ourgaorreri gene tor outer 
surface . . . 


622 


e-17i> 


gb| AE0UU792 |AE00U7y2 


borrena ourgaorreri pxasmia cpzb, 
cotnplet . . . 




e-176 


emb|X6yb96|BBU310yPC 


B. Durgaorteri ospu gene lor outer 
surface . . . 




e-174' 


gb| AFU29B60 1AF029B60 


Borrexia ourgdorren ocx outer surrace 
pro . . . 


b23 " 


e-146 


gb|liyi79B |BBU9179B 


Borrexia Durgaorteri strain L.b outer 
surface. . . 


biy 


e-14b 


gb | L4 2 B y 7 | HUR2 OOSPC 


Borrexia ourgaorreri i strain xp^; oucer 
sur 


S17 


e-14b 


gb|LS1131|BOKOSPC¥ 


Borrexia ourgaorreri; suostram sensu 
strict. . . 


509 


e-142 


cfh|U91792'|BBU91792 


Borrelia ourgaorreri strain wxx outer 
surfac. . . 


T7B 


e~ 13 3 


gb|U91797|BBU91797 


borrexia ourgaorreri strain xfj outer 
surf ac . . . 


"4 k 2 


e-12B — 


go | uyj-aui | bauy lbui 


Borrexia ourgaorreri strain Fit' outer 
surf ac * . . 


444 


e-123 — 


OD} (ABUUXJ7 7 |ABUUXJ77 


Borrexia ^aponica strain N06 / una ror 
Out . . • 


430 


e-TTET 


db} | AB00137B |AB00137B 


Borrelia japonica strain OvKK.7 DNA tor 
Ou . . . 


"430" 


e-HB™ 


emb ! XB47S3 | BBOSPCTXW 


B. ourgaorreri ospu gene t strain txuwj 


418 " 


"e-'llB 


OO} |ABUUU.ibb [ABUULUbb 


Borrexia canuxn una ror outer surrace 
pr. . . 


41B 


e-115 


go j usi /99 {BBuyivyy 


Borrexia ourgaorreri strain ipx outer 
surf ac . • . 


41B 


e-llb 


db3|AU00137t>|AB001376 


Borrexia ^aponica strain F1310 una ror 
uu . . - 


414" 


e-114 


emb|X73G24|BB03PCC 


B .ourgaorreri [uxib) ospc gene 





e-m — 


erao | At>^i62 | bbfcg 


a * ourgaorreri gene ror pt; procein 


3yy 


e-ioy 


etnb|Xb9b9l>|UliWUD0SPC 


b. ourgaorreri ospo gene, 3' end 


3 y b 


e-T0'9~ 


emDiAHl521 |B AOS PCI 


B.arzexn istrain fboi ospu gene 


377 


e-102 


OOJ |ABUUU34b |ABUUUJ4b 


Borrexia arzexn una ror outer surrace 
pr. . . 






'■db 3 |AB00990U|AB009900 


Borrexia arzem gene ror outer surrace 
p. . . 


■3 O 4- 


"HJe'^DB — 


dbj |ABU09Byy|AB009B99 


Borrexia arzexii gene ror outer surrace 
p. . . 


jLj 1 ; ■ - 


■ l'e"-9? — 


emb |XB 1523 |BAOSPCU 


B.arzexn (strain fio/j ospo gene 






gblAFU29B71IAF029B71 


Borrexia ourgaorreri ocia outer surrace 
pr. . . 


337 


le-90 


db;>|AH009Bg7|ABU09897 


Borrexia arzexii gene ror oucer surrace 
p. . . 


337 


le-90 


00} | ABUUU3b4 [ABUUU3&4 


Borrexia tanuKii una ror outer surrace 
pr. . . 


337 


ie-yo 


"gb|L2S413|BOROSPC 


Borrexia ourgaorreri memorane protein 
(ospc) . . . 


33b 


be-90 


dbj |l>49b02|HOROSPO? 


Borrexia arzexii gene ror oucer surrace 
pro . - . 


33b 


be- 90 


emb |XB35bb | UUUNAOtiPC 


b. ourgaorreri ib. paciricus strain; 
ospC gene 


333 


2e-B9 
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Sequences producing significant alignments 








Gene 


Score 


E 

Value 


gb|L42U74 |BOKlUObiPC 


BorreXia Durgaorten (strain urtn> 
outer su. . . 


331 


8e-yy 


-dBTTW'9S03 | SOKOSPCG 


borrexia aizem gene tor outer suriace 
pro ... 


331 


Be- by 


db3 |AB0uyyy4 1 AuauyBy4 


Borrexia arzeXn gene tor outer surrace 
p. . . 


32y 


ie-btt 


gb|L42U90|BOR23OSyc" 


Borrexia Durgaorreri [ scram abij outer 
sur, . . 




Je-BB 


gb|L42H92|BOK250SFC 


BorreXxa Durgaorreri [strain acax; 
outer su . . . 




Je-BB 


db;j |myb01|BOROSPCK 


BorreXia arzeni gene tor outer surrace 
pro, , . 


327 


le-87 


gb|U04240|BBUU424CJ 


Borrexia Durgaorreri ump syntnecase 
(guaA) a. . . 


31V 


le-84 


gb|U042B0lBBUu42BU 


Borrexia Durgaorten gmf syntnetase 
(guaA) g... 


317 


xe-B4 


db3 | ABuuyyoi | AHOuyyoi 


Borrexia arzeni gene tor outer surrace 
p. . . 


1 3u9 


" 3e-82 


db} !AB009893IAB009By3 


Borrexia arzeni gene tor outer surrace 
p. . . 


309 


3e-tJ2 


gb|U2BB3 IBOK170SPC'" 


Borreiia Durgaorreri (strain jsbj outer 
sur . , . 


iOV 


ie-Bl 


gb|U042Bl!BBUu42Bl 


Borrexia Durgaorreri hbxh outer surrace 
prot . . . 


JUb 


be-8X 


db} i AB00989b TABOOS 96 


Borrexia atzexn gene tor outer surrace 
p. . . 


30b 


be-81" 


emb|X81b22|BB05PCl 


B. Durgaorreri (strain PBrej ospc gene 


297 


le-7B 


clb^ | ABUUUJ4y | AB000349 "" 


Borrexia arzexn una tor uuter surrace 
pr. . . 




le-7B 


emblX73625|BBOSPCi5 


B.burgaorreri (Die/) uspc gene 


297 


le-VB 


dbj jD49b09|BOR0SPCM 


Borrexia garinn gene tor outer surrace 
pro . . . 


:29b 


4e-va 


dbj |AB00034b■|ABU00^46 , 


Borrexia arzexn una tor uuter surrace 
pr. . . 




2e- / / 


gb|AF02yy70|AF029B70 


Borrexia Durgaorten ocix outer surrace 
pr. , . 


2«y 




gb|L428yb|BOK2aOKPC"' " 


Borrexia Durgaorten (strain ^bib4j 
outer s . . . 


289 


ie-7S" 


emo i Aa xb^4 | BbusFC^: 


b .Durgaorreri ( strain T2bb) ospc gene 


2THJ 




db;j |uBB29<i|UllB296 - 


Borrexia atzexii zbKb circular pxasmia 
DNA fo. . . 


28y 


3e-75 


embiX8152b|BUOSPC2 


B.garinii (strain wabsouj ospc gene 
>gi|8720, . . 


2Bb 


4e-'7b 


emb | XB4772 | BHOBPCUJ2 


b.garmii ospc 1 gene (strain utLid) 


28b 


4e-/b 


db 3 |ABO0yy02 | ABU099U2 


borrexia arzexn gene tor outer surrace 
p., . 


2*5 


4e-/b 


ctto|U4i»S05|B0RDSFCI ' ' 


Borrexia garinn gene tor outer surrace 
pro 


281 


ve-74 


gb | U01892 |'BBU0TH32 


Borrexia Durgaorreri *dbyi outer surrace 
prot . . . 


^ax 


/e-/4 


e mb | X B J b b 2 | B ADN AU B P C 


B.arzexn (iM^ua strain j ospc gene 


280 




db 3 |U4937B|aURDSPC6-4" 


borrexia ganmi (strain KT64) ospc 
gene f . . . 


27B 


le-72 


■"abj | D49379 |B0ROSPCVS- 


borrexia atzexii (strain vs4bij ospc 
gene . , . 


274 "-' 


^ie- /l 


gb| AFu29Ufc4 |AF02yB*i4 


borrexia Durgaorreri ucb outer surrace 
pro . . . 


265 


4e-&9 


dbj | AB0003bO | AB00D35TT — 1 


Borrexia arzexn una tor outer surrace 
pr. . . 


2bb 


4e-ty 


gb | UUB^b4 |bBUUiiilB4 


borrexia. Durgaorreri 2? / outer surrace 
prote. . . 


264 


2e-bB 


db;j | ABO0U34 J | ABD003TJ' ' 


borrexia atzexii una tor outer surrace 
pr. . . 


260 " 


V'e-67 


db;j [ ABUU03b3 IAHUD035T " 


borrexia tanuKii una ror outer surrace 
pr, . , 


2bb 


le-t6 


emb I XS4779 I BHUSPCMDL 


b.Durgdorteri ospc gene (strain MULJ 


2bb 


4e-fc£ 


erablX84768IBHOSPCDrS ' 


a. atzexii ospc gene (strain uicib) 
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Sequences producing significant alignments 






5 


Database/Accession No. 


Gene 


Score 
(bits) 


Value 






DNA fo. . . 


254 


2e-fe5 






nnrrp i la aarmii aenp rnr our pr ^urtarp 
pro . . - 


254 


ide-b5 






B.garinn (NJ4 strain) ospCi gene 


250 


-de-64 


1 0 


gb|AF029B&5|AF029S6fc 


Borreiia uurgaorr er x uuv outer surface 

r iW * * * 


"2SS 


2e-64 




dh'j [DBB294 |DBB294 


yorrelia garxnxx 26KD circular piasroid 
DNA fo, . . 


f~2W ' 


2e-£>4 




" gb"I L4 12 B B 8 JB0K2 10SFC ' ™ 


Borreixa burgdorferi (strain Hyj outer 
surf ... 


24B 


9e" 6'4 


15 


gbl AFC72 9 8F2 "J EFU2 9 552 


Borreiia ourgcsorieri OCJ outer surface ' 
pro . , . 


24t 


4e-63 




""abyT AOTU'9F9T |ABPU9B91 ' 


Borreiia afzelix gens tor outer surface 
p... 


24f 


4e-fc3' 




db^ ] AB009B9U )AB009B9B 


Borreiia atzelxx gene tor outer surtace 
p. . . 


24b 


4e-63 




gbiAK02?B51 | ATQ29H61 — 


Morir&l la i}iiiroci027£ eri oli^ ourpr ^nrtanp 

pro ... 




4e-63 


20 


en\blX&9by3 [ BBTMOBPl' 


b . Durgdorxerx ospc gene, 3' end 


246 


4e-bJ 




aa;) |U4yi;v|B0R0SPCbV 


Borrexxa garxnix i strain htsV) ospC 

ggftg f , . . 


"T44 '" 


Te^GT" 




do 3 1 D49500"T"BUUO&if CU' 


Borreiia Qarinn cien^* rmr nut* pr QiiT*"i ; 'ar , <3 

Dro . . . 


"231 


le-C2"" 




go|l,42896 |B0R2905PC ' 


Borreiia rjurgaorieri [ strain ^/b79J 
outer s . . . 


242 


€e-€T2"" 


25 


db^ | U4yj fb | BURoapcrc 


Borreiia garinn t strain tulbkj ospc 
gene . . . 


240 


2e-fil" 




gb|L.42B7J |BOR90SPC 


Borreiia ourgaonen ts'trairi SIMON J 
outer su. * . 


23B 


ye-err"" 




db3 |U4»3B1|U0RDSPC37 


Borreiia garinn i strain htjVJ ospC 
gene f . . . 


2TB 


ye-fci 


30 


"ctb^ | U4 94 9 U | BOKOBP CB 


Borreiia garinn gene tor outer surtace 
pro . * • 


23 B 






emb | X6 9 b y 2 | BBT2 bOB PC 


B.ourgdorteri OspC gene, J' end 




^ l e"- , 59 , ' 




emo i A59sy4 | BBPBROBPC 


B .burgdorferi uspt; gene, n 1 end 




9e-S"B 




cir^J|U4ybU6|BOROSi'CJ ,,,, 


Borreiia garinn gene tor outer surtace 
pro . . . 


220 


2e-55"" 




emo |X6 yb9b | BBPBIOSPC 


B.Durgaorteri ospc gene tor outer 
surface . . . 


22D 


"2e"-55 — 


35 


emb | X B 3 b b 4 | BC50BPPTRO" 


b. gar in ii iJfXroD strain; opsu gene 


220 


2e-55 




emu 1X73 626 IBS05PCE" " - 1 ' 


B.ourgaorreri [u&b) uspC gene 


22U 


2e-5b 




gb | L428V0 | BOR60SPC 


Borreiia ourgaorteri (strain VBUA) 
outer sur . . . 


220 


2e-5V ' 




g"b|l J 42By4|BUR270SPC 


sorrexia surgaorteri i strain 2B691J 
outer s • . . 


Zlb 


be-bb 


40 


db3|D49S04|BOROSPCH- 


aorrexia garinix gene ror outer surtace 
pro ... 


20B 


be-52 




gb|L42UfaU|HUU40SPC 


Borreiia nurgaorterx t strain itlV) outer 
surf ... 


2Ut 


3e-51 




"~ctbj|AB000JbU|AB0T>035B " 


aorreixa 3aponica dwa tor uuter surtace 
p. . . 


2U4 


le-bO 


45 


db 3 [AB000351 |ABUUU5bi 


Borrexxa 3aponica UUA tor Outer surtace 
p. . . 


204 


"le-50" 



Example 3 

Evolution of Broad-spectrum Vaccines against Mycobacterium 

50 

[0193] Tuberculosis is an ancient bacterial disease caused by Mycobacterium tuberculosis that continues to be an 
important public health problem worldwide and calls are being made for an improved effort in eradication (Morb. Mortal 
Wkly Rep (1998 Aug 21; 47(RR-13): 1-6). It infects over 50 million people and over 3 million people will die from 
tuberculosis this year. The currently available vaccine, Bacille Calmette-Guerin (BCG) is found to be less effective in 
55 developing countries and an increasing number of multidrug-resistant (MDR) strains are being isolated. 

[0194] The major immunodominant antigen of M. tuberculosis is the 30-35 kDa (a.k.a. antigen 85, alpha-antigen) 
which is normally a lipoglycoprotein on the cell surface. Other protective antigens include a 65-kDa heat shock protein, 
and a 36-kDa proline-rich antigen (Tascon etal. (1996) Nat. Med. 2: 888-92). 
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[0195] Table 6 shows the output of a BLAST search using the 30-35 kDa major M. Tuberculosis antigen (a.k.a. antigen 
85, alpha-antigen) coding sequence to identify homologous genes that may be used in a family shuffling format to 
obtain improved antigens. Many homologous antigens have been cloned and sequenced from a large number of related 
yet distinct mycobacterial strains. These genes are cloned by methods such as PCR, shuffled and screened for im- 
5 proved antigens. 



Table 6 



Sequences producing significant alignments 






Database/Accession No. 


Gene 


Score 

(DltS) 


E 

value 




Mycobacterium tuberculosis Jd JcUa 
extracellu. . . 


iy3y 


0.0 


emb|X623 u B|MT(U>U 


tuberculosis istrain Erdman) gene tor 
85-B a. . . 


iyjy 


0 . 0 


eiab|:WViy3|MTCYlU0 " 


Mycobacterium tuberculosis hj{ ^kv 
complete ge . . . 


lsj y 


0.0 


emb|£&2397|MBB5B 


m. dovis tstrain gene tor tib-B 
antigen 


1931 


"0.0 


gb|M21B3i)|Mi5(3BUtiA " 


m.dovis bcg gene encoding alpha - 
antigen, comp. . . 




0.0 


emb| JtbJUyy (MKAANTIU 


Mycobacterium Kansas ii gene tor alpha 
antigen 


fjiy 


u.o 


dbj JU2&1B7 IMBGAA7 


Myconaccerium scroiuraceum UNA tor 
alpha - ant ig. ♦ . 


70b 


u . u 


db3|nibb4fc|MSGAAG' 


Mycobacterium mtraceiauiare gene tor 
alpha- a... 


551 


e-164' 


embUbJ4J7|HAMART 


M . avium gene tor a ipna- antigen ' 


s&y 


"e-KO 


db} jlJUiJbli IMHUATCCJlJy 


Mycobacterium intracellulars UNA tor 
alph. . . 


533 


e-14y 


gb|LU1095|Mb!UB38CO£- | 


M. leprae genomic dna sequence, coamid 
B38 ... 


371 


e-iOTJ 


emb|X6Uyj4 IMbBbHA 


m. leprae gene ror bb-B antigen 


3fci 


4e-9B 


embjmfefcfc fMLt'HPAPK 


m » leprae ribronectm-biridmg protein 
antige . . . 


34V 




gb|M2701G|MSG32KDA 


Mycobacterium tuberculosis 32 kUa 
antigen gene. 


317 


2e-ST ■ 


emb , |XbJU34|MB32PG 11 


Mycobacterium dovis gene ror .iZKDa 
protein 


317 


*e-b4 


db 3 |D2fe4B6 |MSG32KDAP" 


Mycobacterium dovis genes ror J2Jtt}a 
protei . . . 


317 


2e-B4 


'emb|AL022U7MMTVa2G- 


Mycobacterium tuberculosis H3 7Rv 
complete g . . . 


317 


2e-U4 
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Sequences producing significant alignments 






Database/ Accession No. 


Gene 


Score 
(bits) 


E 

Value 


gb|U47JJb|MTU47JJb 


Mycooacterium tuoercuiosis 
extracellular 3 2 


31 J 


2e-y"4 


db3 | U j MtJUlitJUABSB 


Mycooacterium covis gene tor aipna 
antige. . . 


juy 


be- 82 


«nb|X10J7BlM(JHSAANT 


w.goraonae gene encoamg ub-A antigen 


303 


3e-t*u 


dfcQ |JD7B144|D78144 


Mycooacterium avium gene ror MFXbi, 
antigen 8 . . . 


258 


2e-bb 


gb|My064H (MyUHbAA 


Mycooacterium leprae bb-A antigen gene, 
compl . . . 


236 


be-bu 


dbj |lM3U41|M&ftiAti!>CA 


Mycobacterium leprae una ror antigen as 
com ... 


"222 


ee-56 


eRib|^2b&7|HHAK114V "" ' 


M.mannum gene ror 3 2 Kua protein 
(partial) 




"le-b-4 


em£> J Z J 3 6 b 8 f MA3 2KJP1 & 


m. avium (ATUC iyu7b> gene ror J^Jcua 
protein. . . 




/e-47 


cib^ |DB7323|L>a7323 


Mycobacterium avium gene ror ancigen 
85C and . . . 


186 


Se-4b 


etab|Zj36b7|MA32KPIS 


M. avium { atul" ib/&y; gene ror 32Kua 
protein. . . 


ltt4 


2e-44 


emb|ZJ36b2|M32KPllU 


'M. intracellular (AT(J(J USJbUJ gene tor 
32k. . . 






ento|X92bb<>|MASIA122 


M.asiaticum gene ror 32 jcDa protein 
(partial) 


16B 


ie-jy 


emb|YD771b|MAj21O>K01 


M.asiaticum gene segment or 32-KDa 
protein 


16 H 


le-jy 


emb|ZbU7bU|MA32K511 


M. avium complex gene ror 32 Kua protein 
(pa. . . 


l&B 


'le-J9 


emb|Z507b9|MA3>2K1112 


Ti. avium complex gene ror 32 Kua protein 
<p... 


167 


4e-3y 


emb)Z507b l 7|MAJ2K7b , 9 


m. avium complex gene ror 32 u>a protein 
(pa. . . 


HI 


Je-37 


emb|Zb0774|MA32K366 


m. avium complex gene ror 32 jtua protein 
(pa. . . 


161 


"Je-J7 


"emb|Z336by|MA32KPI7 " 11 


M. avium iatuu iyU74J gene tor 32kUa 
protein. . . 


161 


3e-37 


emb|ZJJbbl|M132KPiy 


m. intracellular e (atul: 3b/b2j gene tor 
32kD. . . 


161 


3e-^7 


emb|Z507b3|MA32K555 " 


m. avium complex gene ror 3 2 Jtua protein 
(pa. . . 


m - 


3e-J7 


e"mb|ZS07;2|MA32K961 


M. avium complex gene ror 32 Kua protein 
(pa... 


IbV 


4e-3S 


emb|ZbU7tb|MAJ2Kb7K" 


m. avium complex gene ror 32 Ki)a protein 
(pa . . . 


""lb 3 ' 


6e-35 


emb | ZbU77U | MA32K904 " 


M. avium complex gene ror 32 itua protein 
(pa. . . 


153 


be-Jb 


emb|Z33fcb;|MM32KE>I15 


M.maimoense gene tor 32KDa protein 
(partial) 


lbJ 


1,1 fee- 3 b 


emb|Zb'07by [MA32KB14 


m. avium complex gene ror 32 Kua protein 
(pa. . . 


lbJ 


6e-Jb 


emb|ZbCJ764|MAi2K575 - " 


m. avium complex gene ror 32 KDa protein 
(pa. . . 


14SJ- 1 


ie-33 


emb|ZSUU2|MAJ2K55S 


M. avium complex gene ror 32 Kua protein 
(pa. . . 


145 ' 


2e-32 


emb tJC5722y (MTUbCG ™ " 


Mycooacterium tuoercuiosis gene ror 
antigen 8 . . . 


14b 


2e-32 


" erab|Z$0 yfcll , HA32KS54 ~ 


m. avium complex gene ror 32 kUa protein 
(pa. . . 


14b 


2e-32 


emb|Zy2770|RTCI5 , 


Mycooacterium tuoerculosis H37kv 
complete geno. . . 


14a 


2e-32 


emb| Ay^bvu jMsZUUi 


M.s2uigai gene ror 32 Jdla protein 
(partial) 


141 


2e-Jl 


emb | Zb U 7 b « 1 HAi 2K1076" 


m. avium complex gene ror 32 xua protein 
(p... 


127 


4e-27 


emb|ZbU766|MA32K5T7 " 


M. avium complex gene ror yt xua protein 
(pa . , . 


'122' 


1 be-26 


emb 1 233 ^5 | ME32XFT2 


m.oovis isccij gene ror 32KL>a protein 
(parti . . . 


uy 


9e'-2b 



55 
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Sequences producing significant alignments 






Database/Accession No. 


Gene 


Score 
(bits) 


E 

Value 


"emir[X925V3|MT&IVI51 " 


M.tnviaie gene ror 3 2 Kua protein 
(partial) 


ioy 


ye-22 


embiXy^bBJ |MCJ2L11U<> 


M.ceiatum gene ror 32 Kua protein 
(partial) 


TOO " 


8e-iy 


emb | Z219bU (MLBSAPRA 


M. leprae or bbA protein gene 
>gi |287923 )emb. . . 


90 


Be-16 




Mycooactenum leprae cosmia H2fe DMA 
sequence . 


B8 


Je-lb 


emD | £219bl I MLiSbCPRA 


M. leprae or bbu protein gene 


HB 


3e-lb 


gb|Hy0649|MKU8!»CA 


MycoDactenum leprae a^-u antigen gene, 
compl . . . 


HS 


3e-l5 


enib|X92382|MCEL12Jb 


M.ceiatum gene ror 32 Kua protein 
(partial) 




l'e-14 


emblX92b77|MPHLE89 


M.pniei gene ror 32 xua protein 
(partial) 


82 


2e-U 


emb|X92bBl|MBKA1077 


M.Dranden gene ror 32 KL>a protein 
(partial) 


S2 


2e-l3" 


ento| r0771B|MF3i!KPRO4 


M.riavescens gene segment oi jJU-kUa 
protein 




2e-li 


emb|Xy2b7b|MKOKTUl 


M.rortuitum gene tor 32 XDa protein 
(partial) 




Be-13 


emb|Z33&fc3|MA32KPIll 


M. avium- mtraceiiuiare complex gene ror 
32. . . 




le-11 


emb|X9ab7t|HPISUliU2 " 


M.peregrinum gene ror 3-d Kua protein 
(partial) 


74 


be-11 


emb|Zb077fc]MAH04B94 


M . avium complex gene ror 32 Kua protein 
(pa. . . 


68 


Je-09 


emb|X92b71|MXKHU2Ul 


w.xenopi gene ror 32 Kua protein 
(partial) 


68 


3e-09 


emb|YD771V|Hii3aKPttUJ 


M.smegmatis gene segment or 3^-Kua 
protein 


66 


le-DB 


emb|Z5'07f»9|HA32KB2a 


M. avium complex gene ror 32 Kua protein 
(pa. . , 


50 


7e-07 


emb|ZS077S|MAH03W4 


m * avium complex gene ror 32 jo>a protein 
(pa. . . 


60 


7e-07" 


emD | xu v / iy | MV3^tu»KUb 


M.vaccae gene segment 01 32-Kua protein 


-54- ■ 


4e-0b 


emb|X925BU|HVACyi 


M.vaccae gene ror 32 Kua protein 
(partial) 


54 


4e*0b 


emb[Xy2b74|MNONCMb 1 " -"" 


M.noncnromogemcum gene ror 32 KDa 
protein ( . . . 


48 


0.UU3 


emb|Xy2bty |MyiM195 ' 


M.simiae gene ror 3-d Kua protein 
(partial) 


48 


D.003 


emb|AJUu21bU|MTAJ21b'0 


Mycooacterium tunercuiosis nJVkv, MPTb'l 
gene 


4b 


0.011 


emb|Z79700|HTCY10D7 ' 


Mycobacterium tuDercuiosis H3/kv 
complete g . . . 


44 


U.U43 


gb|MbU472|ATUCAT 


a. turner aciens cniorampnemcol 
acetyltransf eras . , - 


42 


0.17 


enb|X9257S|MtiMJ£ti90 ■' "" 


M.smegmatis gene tor i2 JtDa protein 
(partial) 


40 


0.66 


emb|Xy2b72|MTt:K2(>0 


M.terrae gene tor 32 kba protein 
(partial) 


40 


U.fab 


emblX92b6BlMSCUUl49 - -" 


m. scroruiaceum gene ror 32 JcDa protein 
(par... 


40' 1 


0.fc6 


ehibj^bbb |M(JJ2Kyil4 


M.goraonae iatCU 144 71) ) gene tor J2JtUa 
pro . . . 


40 


0755 


gb l MI7700 | FLCNPUA ™ 1 ™" 


mnuenza u/cai norma/ /a nucieoorotem 
RNA ( . . . 


3b 


2.6 



50 Example 4 

Evolution of Broad-spectrum Vaccines against Helicobacter pylori 

[0196] Chronic infection of the gastroduodenal mucosae by Helicobacter pylori bacteria is responsible for chronic 
55 active gastritis, peptic ulcers, and gastric cancers such as adenocarcinoma and low-grade B-cell lymphoma. An in- 
creasing occurrence of antibiotic-resistant strains is limiting this therapy. The use of vaccines to both prevent and treat 
ongoing infections is being actively pursued (Crabtree JE (1998) Gut 43: 7-8; Axon AT (1998) Gut 43 Suppl I: S70-3; 
Dubois etal. (1998) Infect. Immun. 66: 4340-6; Tytgat GN (1998) Aliment Pharmacol. Ther. 12 Suppl 1 : 123-8; Blaser 
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MJ (1998) BMJ 316: 1507-10; Marchetti et at. (1998) Vaccine 16: 33-7; Kleanthous et al. (1998) Br. Med. Bull. 54: 
229-41; Wermeille et al. (1998) Pharm. World Sci. 20: 1-17. 

[0197] Identification of appropriate Helicobacter antigens for use in preventive and therapeutic vaccines can include 
two-dimensional gel electrophoresis, sequence analysis, and serum profiling (McAtee et al. (1998) Clin. Diagn. Lab. 
Immunol. 5:537-42; McAtee et al. (1998) Helicobacter 3: 163-9). Antigenic differences between related Helicobacter 
species and strains can limit the use of vaccines for prevention and treatment of infections (Keenan et al. (1 998) FEMS 
Microbiol Lett. 161: 21-7). 

[0198] In this Example, DNA family shuffling of related yet immunologically distinct antigens allows for the isolation 
of complex chimeric antigens that can provide a broad cross-reactive protection against many related strains and 
species of Helicobacter. Mouse models of persistent infection by mouse-adapted H. pylori strains that have been used 
to evaluate therapeutic use of vaccines against infection are used to evaluate shuffled antigens (Crabtree JE (1998) 
Gut 43: 7-8; Axon AT (1998) Gut 43 Suppl 1 :S70-3). 

[0199] The vacuolating cytotoxin (VacA) and cytotoxin associated gene products (CagA) have been evaluated as a 
vaccine against H. pylori infection in animal models which supports the application of this approach in humans. 
[0200] Table 7 shows the results of a BLAST search using the H. pylori Mack gene to identify homologous genes 
that can be used in a family shuffling format to obtain improved antigens. Homologous antigens have been cloned and 
sequenced from a number of related yet distinct H. pylori strains and additional natural diversity can be obtained by 
cloning antigen genes from other strains. These genes and others or fragments thereof are cloned by methods such 
as PCR, shuffled and screened for improved antigens. 
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Table 7 



Sequences producing significant alignments 






uaiauase/ Accession i^o, 


Gene 


Score 
(bits) 


E 

Value 


c(b|U9b971|HPU95$m 


Helicobacter pylori 95-b4 Idiots) 
inactive cy. . . 


7B74" 


0.0 


gb|AK00059H |HPA*:000b9B 


Helicobacter pyiori section /t> or 
134 of . . . 


246b 


0.0 " 


gb|AF00135B |HPAF00135S 


Helicobacter pylon vacuolating 
cytotoxi . . . 


240* 


0.0 


emb|Z26B83 IHPCVTTOX 


H , pylori gene tor cytotoxin. 


23tjy 


0.0 


gb|U05€>77 |HPU0b677 


Helicobacter pylori 87 -^oj 
vacuolating cytot . . . 


23B9 


0.0 1 


gb|UO567fe|HFU0b676 


Helicobacter pylori fcoiyo 
cysteinyl-tRNA syn. . . 


23bb _ 


0.0 


gb | U2 y-4'0 1 ]"HPU2 94 01 


Helicobacter pylori vacuolating 
cytotoxin ho . * . 


234b" 


0.0 


-emb-|AJ006yt9|HP!«r€969 


Helicobacter pylori vacA gene, 
strain Mz28 . . . 


210'3' 


0.0 


gb|S72494 |U72494 ' 


14 u Kaa cytotoxin (.Helicobacter 
pylori , Genomi . . . 


2050 


0.0 


gb| U07145|HPU0714b 


Helicobacter pylori WCTU '"llb'JU 
cysteinyl tRN. . . 


20bU 


0.0 


emb|AJ00feyfe8|HP¥696H 


Helicobacter pylori vacA gene, 
strain Mz26 . . . 


2032 


u.o 


emb|AJ006970 jHPY6570 


Helicobacter pylori vacA gene, 
strain Mz29. . . 


1592 " 


0.0" 


gb"| AF077y3yjAF077939 


Helicobacter pylori strain ib6 
vacuolating . . . 


1B34 


0.0' 


gb|AF077940|AF077940 


Helicobacter pylori strain tjy 
vacuolating. . . 


1814 


0.0' 


gb|AF077941|AK077941 


Helicobacter pylori strain b4y 
vacuolating. . . 


17 7 B 


0.0 


gb|AF07793B|AF077938 


Helicobacter pylori strain t>u 
vacuolating . , . 


1746 


0.0 


gb|U632bb|HPU53255 


Helicobacter pylori vacuolating 
cytotoxin ge . . . 


835 


0.0 


gb|U£3270"lHPU63270 


Helicobacter pyiori vacuolating 
cytotoxin ge . . . 


Biy 


0.0 


gb|UU27T|HPU£3272 - 


Helicobacter pylori vacuolating 
cytotoxin ge. . . 


819 


0.0 


gb |U&i2B3|HPU63ZS3 " " 


Helicobacter pylori vacuolating 
cytotoxin ge. . . 


819 


o.o 


gb|U632B4|HPUfc32B4 


Helicobacter pylori vacuolating 
cytotoxin ge , . . 


819 1 


0.0 


gb|Ufc3262|HPU53262 


Helicobacter pyiori vacuolating 
cytotoxin ge,,. 


B03 


0.0 


gb|U6J2ty |^PU632C8■ 


Helicobacter pyiori vacuolating 
cytotoxin ge . * . 


803 


'"0 . o - 


gb|Ufc327J |HPUfc3273 


Helicobacter pyiori vacuolating 
cytotoxin ge.., 


B03 


0.0 


g5|U632B2|HPUb32B2 


Helicobacter pyiori vacuolating 
cytotoxin ge . . . 


803 


0.0 


gb!U6i259 IKPUS3259 " " 


Helicobacter pyiori vacuolating 
cytotoxin ge. . . 


79b 


0,0 


gb|U&327fc|HPU63276> " 


Helicobacter pyiori vacuolating 
cytotoxin ge. . . 


779 


0.0 


gb|U6J2B7|HPLJ63-2B7 


Helicobacter pyiori vacuolating 
cytotoxin ge . . . 


779 


0.0 


gb|U632fcX|"HPU£>32&3 


Helicobacter pylori vacuolating 
cytotoxin ge. * . 


771 


"u .O ' ■ - 


gb|U632&y |HPUfc3269 " 


Helicobacter pyiori vacuolating 
cytotoxin ge. . . 


771' 


TT.O 
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Sequences producing significant alignments 






Database/Accession No. 


Gene 


Score 
(bits) 


L 

Value 


gfc|U63286 |HPUb32be 


Helicobacter pylori vacuolating 
cytotoxin ge . . . 


"771' 


o.o 


gb|Ufe327i> |HPU6i27i> 


Helicobacter pyiori vacuolating 
cytotoxin ge . . . 


76i 


0.0 


go | ufe32/y |apufaJ2/y 


Helicobacter pyiori vacuolating 
cytotoxin ge . . . 


"Vfei 


0 . 0 


gb|U63277|HPU63277 


Helicobacter pyiori vacuolating 
cytotoxin ge . . . 


75b " 


o . u 


gbjU532B0|HPU632B0 


Helicobacter pyiori vacuolating 
cytotoxin ge . . , 


755" 


0.0 


gb|U£3265|HPU5326i> 


Helicobacter pyiori vacuolating 
cytotoxin ge. . . 


74 7 


0.0 


gb|U6i^67 (HPU6J26V 


Helicobacter pyiori vacuolating 
cytotoxin ge . . . 


747 


-oro ■ 


gb|U632Bl|HPUS32Bl 


Helicobacter pyiori vacuolating 
cytotoxin ge. . . 


741 


0.0 


gb|U63261|HPU63251 


Helicobacter pyiori vacuolating 
cytotoxin ge . . . 


739 


0.0 


gb|U63274 |HPU63274 


Helicobacter pyiori vacuolating 
cytotoxin ge . . . 


739' 


0.0 


gb|U53285|HPU632B5 


Helicobacter pyiori vacuolating 
cytotoxin ge, . . 


"737"" 


0.0 


emb|AJ0094iO |HPAjy430 " 


Helicobacter pyiori vacA gene 
(partial) 


737 


0.0 


emt>|AjUU943S |HPAJ943B 


Helicobacter pyiori vacA gene 
(partial) , . . . 


730" 


0 . 0 


emb|AJ005439 |HPAJy439 


Helicobacter pyiori vacA gene 
(partial) , ... 


730 


0.0 


gb|US3271|KPUG3271 


Helicobacter pyiori vacuolating 
cytotoxin ge . . . 


724' 


0T0" 


emo |Aduoy4ie |hpaj9418 


Helicobacter pylori vacA gene 
(partial) , ... 


122 


0.0 


emb|AJ00iJ422|HPAJ9422 


Helicobacter pylori vacA gene 
(partial) , ... 


122 


0.0 


gb'|U63256|HPU63256 


Helicobacter pyiori vacuolating 
cytotoxin ge. . . 


716 


0.0 


'gb|U6326fc|HPU632S6 


Helicobacter pyiori vacuolating 
cytotoxin ge . . . 


716 


0.0 


emblAJUUi^U |HyAJ942U 


Helicobacter pyiori vacA gene 
(partial) , ... 


714 


0.0 ' 


emb|AJ0Uy4^4|HPAJ94i!4 ' " 


Helicobacter pyiori vacA gene 
(partial) , ... 


714 


0.0 


emb | AJ 0 ti y 4 J 1 1 H P A J 9 4 li 1 


Helicobacter pyiori vacA gene 
(partial) , . . . 


'714 


0.0 


gb|U63260|HPU6326TT" 


Helicobacter pyiori vacuolating 
cytotoxin ge. . . 


708 


0.0 


gb | U5327B | HPUS327B 


Helicobacter pyiori vacuolating 
cytotoxin ge. . . 


"70B 


o;"o 1 


emi>|AJUUy4iy |HPAJ9419 


Helicobacter pyiori vacA gene 
(partial) , ... 


706 


0.0 


emb|AJ00y42B |HPAJ942B 


Helicobacter pyiori vacA gene 
(partial) , , . . 


706 


0.0 


emb|AJ009437 JHPAJ9437 


Helicobacter pylori vacA gene 
(partial) , ... 


706 


■ o:o 


emD|AJ0Uy4^7 |HPAJ'9427 


Helicobacter pyiori vacA gene 
(partial), ... 


704 


0.0 


gb|U63257|HPU532S7 " ' 


^Helicobacter pyiori vacuolating 
cytotoxin ge . . . 


700 


0.0 


emb|AJ00y4w!J !H^AJ9423 


Helicobacter pyiori vacA gene 
(partial) , ... 


653 


0.0 
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Sequences producing significant alignments 






Database/ Accession No. 


Gene 


Score 

/hits) 


E 


" pmb ] A'J 0 2 I H PA J 5^"3 2 


(partial) , ... 




0.0 


"emb 1 AJOTTg^ TV i HP AJ 54X7 ' 


Helicobacter* d'vj ori vacA aene 
(partial) , . . . 




0.0 


" ' emnTAJTJD' 9^ 2 1 1 HP AJ 9 4 71" 


Heii coba.cc.eir pyxon vacA gsne 
(partial) , 


bb i y 


0.0 


embl AJ009425 IHPAJ9426 


Helicobacter bvidri vacA aene 
(partial) , ... 


68B 


0.0 


" pmb" I AJO'D 9"4 3 5"! 1 B " 

dll4J i v u «j u i xir^ruJ ^ ^ *j u 


iiw *l jl wwu w x w y x> wx ^ v a wiiw 

(partial) , ... 


t»BB 


0.0 


g , i5"lTT53264"IHPIT53255"' " 


Helicobacter pylori vacuolating 
cytotoxin ge . . . 


676 


0.0 


embl AJ009433 IHPAJ9433 


Helicobacter DViori x^acA aeri'ie 
(partial) , ... 


€66 


0."D 


emb"'l "AJO 0 94 2 S"l HPAJ9 "3T5 


Hp 1 1 c*oJnact~ pr nviori vacA afne 

i4wA A^rf w*^ W l» W A, u y ^ A, ^ v c% 3 ^ 

(partial) , ... 


"S5B 


0.0 


"§5 rXJS'3 , 25B" , IHPU5325iB " "' " 


Helicobacter pylori vacuolating 
cytotoxin ge . . . 


652 


0.0 


emb I AJ009442'! HPAJ944T" 1 " 


Hp I i rnharf pt nv j nr*i varA nAnp 

(partial) , ... 


V2'b~ 


e-177 


'embl'AJOO 944riTrraTgro 


Hp f i r'ntiacfPT* nv ion va r*A npnf 

(partial) , ... 


S2"6' 


e-177 


ab IUB005B lHPUB0"o'6"B 

H*^ f U w V V W U j 1^£^ W W w w W W 


Helicobactei* nvion strain lil3 
vacuolating . , . 


622 


e-175 


emETSlToTJ 9434 I HPAJ9434 — 


(partial) , ... 


61B 


e-174 


emb | AJ009441 ]HPAJ9441 


Helicobacter ovion vacA aene 
(partial) , ... 


6U3 


e-170 


aD 1 AF0^i5€> l6 1 H.FV(^P2 

1 V ^ ^ V **p W 1 AAA V W*^ A* 


Helicobacter DViori strain K34A 
vacuolating . . . 


599 


e-16B 


emh 1 AJO 0 9447 1 HPAT94T7"" ~" 


hp 1 t ponarfpf nv t nn varA cjp^tip 

(partial) , ... 


5B7 


e-165 


emb | AJOO'9440 | HPAJ9 , 4~4"0" 


Helicobacter pylori vacA gene 
(partial) , ... 


"563 ' 


e-15B 


emb I AJ009435 | HPAJ9436 


Helicobacter DViori vacA aene 
(partial) , ... 


55b 


e-155 


emb 1 AJ009443 1 HPAJ94"43 ' 


top 1 "i rohjscf pr nvioin vacA apnp 

(IW -X ^ w VA/C. w w J- J ^* W A. ^ V.GftW^% Wilt* 

(partial) , ... 


555 


e-lbb 


eiUD | AJL)0y44fa | HPAiJ944b 


Helicobacter pylori vacA gene 
(partial) , ... 


Sbb 


e-155 


go j UB 0067 | HPUB0U67 


Helicobacter oviori strain 1B4 
vacuolating . . , 


5b3 


e-155 


gb | AF04273 5 1 AF047735 


Helicobacter pylon o K^.z 
vacuolating cytot. * . 


55J 


e-155 


emb 1 AJ009445 IHPAJ9445 


Helicobacter oviori vacA aene 

"taAAwWi/BWJfWi ^Jr X V C.Wli- ^)C41C 

(partial) , ... 


547 


e-153 


~ab 1 AF03 E"5T3 9 [ AFO'35509 


vacuolatin. . . 


547 


e-153 


gb|AF042734 |At , 04"2734 


Helicobacter pylori jjii vacuolating 
cytoto. . . 


53 7 


"e-lbO 


gb|AF03b£>l2|AF035612 


Helicobacter pylori strain K2 6A 
vacuolatin . . . 


347 


9e-93 


gb|AF035613|AF0356I3 


Helicobacter pylori strain K40A 
vacuolatin. . . 


323 


le-B5 


embjAJOO^ybV |H U ¥6S67 


Helicobacter pylori vacA gene, 
strain Mzi9 . . . 


317 


Be-B4 


gb|AF035515|HPVCPl 


Helicobacter pyiori strain RJ4A 
vacuolating . . . 


315 


^'e-B3 


gb|AF03bfcl4|At'03$614 


Helicobacter pylori strain Kb oa 
vacuolatin. . . 


307 


Be- 81 
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Sequences producing significant alignments 






Database/ Accession No. 


Gene 


Score 


E 

Value 


-rfVrtTTm H7"R rTTDTmT'^TTT" 


gene , pa . . . 




4e-21 


' aETU9157S'tHPTT9T57S" 

«3 W J w «r X .J / J | A 1 XT W ^ X -J / m> 


HeiicoDacter Dvior.3 strain K7 y vacA 
gene , pa . . * 


109 


4e-21 


| U ^ .A J « J f iA.T U J pi. J / J 


He* ) i r*f"jna ft - ef nv i on strain KHd varA 

gene , pa . . . 


-107' ■ 


"le-20 




up j i nrsna ft* &r nv t nri T*t* i a I varii 
xx ^4X1' kj±jc\ o o w x c J £JC*4. l*.lcl-i- vaun 

gene, stra. . . 


101 


ye-iy 


gb|U91577 (HPU91577 


Helicobacter pylori strain f^4 vacA 
gene, pa. . . 


100 


3e-l6 


gb|U91576 (HFU91576 


Helicobacter pylori strain k~/± vacA 
gene, pa. . . 


1U0 


Se-IB 


gb|AK03b610|AF0356lO 


Helicobacter pylori strain RliA 
vacuolatin. . . 


94 


2e-16 


gb|U915H0|HPUyi5E30 


Helicobacter pylori strain tou vacA 
gene, pa. . , 


92 


be-ib 


gblAPDSSfelllAlf'OSbGll 


Helicobacter pylori strain KbyA 
vacuolatin, . , 


92 


Be-16 


emb |' 714744 | HP VACA4 9 


Helicobacter pylori partial vacA 
gene, stra. . . 


H8 
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[0201] Table 8 shows the results of a BLAST search using the H. pylori CagA gene to identify homologous genes 
that can be used in a family shuffling format to obtain improved antigens. Homologous antigens have been cloned and 
sequenced from a number of related yet distinct H, pylori strains and additional natural diversity can be obtained by 
cloning antigen genes from other strains. These genes and others or fragments thereof are be cloned by methods such 
as PCR, shuffled and screened for improved antigens. 



Table 8 



Sequences producing significant alignments 






Database/Accession No. 


Gene 


Score 
(bits) 


E 

Value 


gb|AF083352|AF083352 


Helicobacter pylori cytotoxin associated p... 


7041 


0.0 


gb|AE000569|HPAE000569 


Helicobacter pylori section 47 of 134 of... 


5501 


0.0 


gb|Ll 1 714jHECMAJANT 


Helicobacter pylori major antigen gene 
sequ... 


4976 


0.0 


emb|X70039|HPCAI 


H.pylori cai gene for cytotoxicity associated 


4294 


0.0 


gb|U60176|HPU60176 


Helicobacter pylori cag pathogenicity 
island... 


4294 


0.0 


dbj|AB003397|AB003397 


Helicobacter pylori DNA for CagA, 
complet... 


4274 


0.0 


gb|U80066|HPU80066 


Helicobacter pylori strain 213, cytotoxin- 
as... 


349 


2e-93 
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Sequences producing significant alignments 






Database/Accession No. 


Gene 


Score 
(bits) 


E 

Value 


gb|U80065|HPU80065 


Helicobacter pylori strain 1 84, cytotoxin- 
as... 


343 


le-91 


gb|AF043488|AF043488 


Helicobacter pylori JK252 cytotoxicity 
ass... 


178 


4e-42 


gb|AF043487|AF043487 


Helicobacter pylori JK25 cytotoxicity 
asso... 


170 


le-39 


gb|AF043489|AF043489 


Helicobacter pylori JK269 cytotoxicity 
ass... 


163 


2e-37 


emb|X70038|HPCAIDUP 


H.pylori DNA duplication sequence within 
th... 


159 


4e-36 


gb|AF043490|AF043490 


Helicobacter pylori JK22 cytotoxicity 
asso... 


153 


2e-34 



Example 5 

Development Of Broad-Spectrum Vaccines Against Malaria 

[0202] This Example describes the use of DNA shuffling to generate improved vaccines against malaria infection. 
An excellent target for evolution by DNA shuffling is the Plasmodium falciparum merozoite surface protein, MSP 1 (Hui 
et al. (1 996) Infect Immun. 64: 1 502-1 509). MSP 1 is expressed on the surface of merozoites as an integral membrane 
protein. It is cleaved by parasite proteases just before and concomitant with rupture and release from infected cells. 
The cleavage appears to be obligatory for full function in MSP 1 binding to RBC receptors. The cleaved fragments 
remain attached to the membrane of the merozoite. Other membrane proteins on merozoites also participate in the 
attachment and specific invasion events. MSP1 is a proven candidate for inclusion in a vaccine against the asexual 
blood stage of malaria. 

[0203] The genes encoding MSP I can be isolated from various isolates of Plasmodium falciparum merozoites by 
PCR technology. Related naturally existing genes can be additionally used to increase the diversity of the starting 
genes. A library of shuffled MSP1 genes is generated by DNA shuffling, and this library is screened for induction of 
efficient immune responses. 

[0204] The screening can be done by injecting individual variants into test animals, such as mice or monkeys. Either 
purified recombinant proteins, or DNA vaccines or viral vectors encoding the relevant genes are injected. Typically, a 
booster injection is given 2-3 weeks after the first injection. Thereafter, the sera of the test animals are collected and 
these sera are analyzed for the presence of antibodies that reduce invasion of merozoites into uninfected erythrocytes 
(RBC). RBC are infected by the merozoite, immediately inside the RBC, the merozoite differentiates into a ring and 
this matures to a schizont that contains several nascent daughter merozoites, which then burst out of the infected cell, 
destroying it, and go on to attach and invade another RBC. In vivo, the merozoite is likely only extracellular for seconds. 
In vitro, any blockade of this event can dramatically reduce the level of reinfection. Antibodies against MSP 1 bind to 
the surface of merozoites that are released from schizont infected RBC when they rupture and thereby reduce the 
ability of these merozoites to attach and engage cognate RBC receptors on the uninfected RBC surface. Merozoite 
attachment is reduced, merozoite entry into new RBC is reduced, and the numbers of newly invaded cells detected at 
the early ring stage is therefore reduced if the culture is examined several hours after the blockade of invasion test. In 
some assay formats a surrogate of merozoite invasion inhibition is to note the appearance of agglutinated merozoites, 
although this is an indirect measure of antibodies that cause reduced invasion. 

[0205] The shuffled antigens that induce the most potent antibody responses reducing invasion of merozoites into 
uninfected erythrocytes are selected for further testing and can be subjected to new rounds of shuffling and selection. 
In subsequent studies, the capacity of these antigens to induce antibodies in man is investigated. Again, either purified 
recombinant antigens, or DNA vaccines or viral vectors encoding the relevant genes are injected and the protective 
immune responses are analyzed. 
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Example 6 

Development Of Broad-Spectrum Vaccines Against Viral Pathogens 

5 [0206] This Example describes the use of DNA shuffling to obtain vaccines that can induce an immune response 
against multiple isolates of viral pathogens. 

A. Venezuelan equine encephalitis virus (VEE) 

10 [0207] VEE belongs to the alphavirus genus, which are generally transmitted by mosquitoes. However, VEE is an 
unusual alphavirus in that it is also highly infectious by aerosol inhalation for both humans and rodents. The disease 
manifestations in humans range from subclinical or mild febrile disease to serious infection and inflammation of the 
central nervous system. Virus clearance coincides that of production of specific anti-VEE antibodies, which are believed 
to be the primary mediators of protective immune responses (Schmaljohn et al. (1982) Nature 297: 70). VEE is an 

15 unusual virus also because its primary target outside the central nervous system is the lymphoid tissue, and therefore, 
replication defective variants may provide means to target vaccines or pharmaceutically useful proteins to the immune 
system. 

[0208] At least seven subtypes of VEE are known that can be identified genetically and serologically. Based on 
epidemiological data, the virus isolates fall into two main categories: l-AB and l-C strains, which are associated with 
20 VEE epizootics/epidemics, and the remaining serotypes, which are associated primarily with enzootic vertebrate-mos- 
quito cycles and circulate in specific ecological zones (Johnston and Peters, In Fields Virology, Third Edition, eds. B. 
N. Fields etal., Lippincott-Raven Publishers, Philadelphia, 1996). 

[0209] The envelope protein (E) appears to be the major antigen in inducing neutralizing Abs. Accordingly, DNA 
shuffling is used to obtain a library of recombinant E proteins by shuffling the corresponding genes derived from various 
25 strains of VEE. These libraries and individuals chimeras/mutants thereof are subsequently screened for their capacity 
to induce widely cross-reacting and protective Ab responses. 

B. Flaviviruses 

30 [0210] Japanese encephalitis virus (JE), Tick-borne encephalitis virus (TBE) and Dengue virus are arthropod-borne 
viruses belonging to the Flavivirus family, which comprises 69 related viruses. The heterogeneity of the viruses within 
the family is a major challenge for vaccine development. For example, there are four major serotypes of Dengue virus, 
and a tetravalent vaccine that induces neutralizing Abs against all four serotypes is necessary. Moreover, non-neutral- 
izing antibodies induced by infection or vaccination by one Dengue virus may cause enhancement of the disease during 

35 a subsequent infection by another serotype. Therefore, cross-protective, broad spectrum vaccines for TBE and JE 
would provide significant improvements to the existing vaccines. In this Example, the ability of DNA shuffling to efficiently 
generate chimeric and mutated genes is used to generate cross-protective vaccines. 

1. Japanese encephalitis virus 

40 

[0211] Japanese encephalitis virus (JE) is a prototype of the JE antigenic complex, which comprises St. Louis en- 
cephalitis virus, Murray Valley encephalitis virus, Kunjin virus and West Nile virus (Monath and Heinz, In Fields Virology, 
Third Edition, eds. B.N. Fields etal., Lippincott-Raven Publishers, Philadelphia, pp 961-1034, 1996). Infections caused 
by JE are relatively rare, but the case-fatality is 5-40% because no specific treatment is available. J E is widely distributed 
45 in China, Japan, Philippines, far-eastern Russia and India providing a significant threat to those traveling in these 
areas. Currently available JE vaccine is produced from brain tissues of mice infected with single virus isolate. Side 
effects are observed in 1 0% to 30% of the vaccinees. 

[0212] To obtain chimeric and/or mutated antigens that provide a protective immune response against all or most of 
the viruses within the JE complex, DNA shuffling is performed on viral envelope genes. The amino acid identity within 

50 the JE complex varies between 72% and 93%. In addition, significant antigenic variation has been observed among 
JE strains by neutralization assays, agar gel diffusion, antibody absorption and monoclonal antibody analysis (Oda 
(1976) Kobe J. Med. Sci. 22: 123; Kobyashi etal. (1 984) Infect. Immun. 44: 117). Moreover, the amino acid divergence 
of the envelope protein gene among 1 3 strains from different Asian countries is as much as 4.2% (Ni and Barrett (1 995) 
J. Gen. Virol. 76: 401). The resulting library of recombinant polypeptides encoded by the shuffled genes is screened 

55 to identify those that provide a cross-protective immune response. 
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2. Tick-borne encephalitis virus 

[0213] The tick-borne encephalitis virus complex comprises 14 antigenically related viruses, eight of which cause 
human disease, including Powassan, Louping ill and Tick-borne encephalitis virus (TBE) (Monath and Heinz, In Fields 

5 Virology, Third Edition, eds. B.N. Fields et al., Lippincott-Raven Publishers, Philadelphia, pp. 961-1034, 1996). TBE 
has been recognized in all Central and Eastern European countries, Scandinavia and Russia, whereas Powassan 
occurs in Russia, Canada and the United States. The symptoms vary from flu-like illness to severe meningitis, menin- 
goencephalitis and meningoencephalitis with a fatality rate of 1% to 2% (Gresikova and Calisher, In Monath ed., The 
arboviruses: ecology and epidemiology, vol. IV, Boca Raton, FL, CRC Press, pp. 177-203, 1988). 

10 [0214] Family DNA shuffling is used to generate chimeric envelope proteins derived from the TBE complex to gen- 
erate crossprotective antigens. The envelope proteins within the family are 77-96% homologous, and viruses can be 
distinguished by specific mAbs (Holzmann etal, Vaccine, 10, 345, 1992). The envelope protein of Powassan is 78% 
identical at the amino acid level with that of TBE, and cross-protection is unlikely, although epidemiological data is 
limited. 

15 [0215] Langat virus is used as a model system to analyze protective immune responses in vivo (lacono-Connors et 
al. (1 996) Virus Res. 43: 1 25). Langat virus belongs to the TBE complex, and can be used in challenge studies in BSL3 
facilities. Serological studies based on recombinant envelope proteins are performed to identify immunogen variants 
that induce high levels of antibodies against envelope proteins derived from most or all viruses of the TBE complex. 

20 3. Dengue viruses 

[0216] Dengue viruses are transmitted though mosquito bites, posing a significant threat to troops and civilian pop- 
ulations particularly in tropical areas. There are four major serotypes of Dengue virus, namely Dengue 1, 2, 3 and 4. 
A tetravalent vaccine that induces neutralizing antibodies against all four strains of Dengue is required to avoid antibody- 

25 mediated enhancement of the disease when the individual encounters the virus of the other strain. 

[0217] The envelope protein of Dengue virus has been shown to provide an immune response that protects from a 
future challenge with the same strain of virus. However, the levels of neutralizing antibodies produced are relatively 
low and protection from live virus challenge is not always observed. For example, mice injected with genetic vaccines 
encoding envelope protein of Dengue-2 virus developed neutralizing antibodies when analyzed by in vitro neutralization 

30 assays, but the mice did not survive the challenge with live Dengue-2 virus (Kochel et al. (1 997) Vaccine 1 5: 547-552). 
However, protective immune responses were observed in mice immunized with recombinant vaccinia virus expressing 
Dengue 4 virus structural proteins (Bray et al. (1989) J. Virol. 63: 2853). These studies indicate that vaccinations with 
E proteins work, but significant improvements in the immunogenicity of the protective antigens are required. 
[0218] In this Example, DNA shuffling is performed on the genes encoding the envelope (E) protein from all four 

35 Dengue viruses and their antigenic variants. Family DNA shuffling is used to generate chimeric E protein variants that 
induce high titer neutralizing antibodies against all serotypes of Dengue. The E proteins of the different dengue viruses 
share 62% to 77% of their amino acids. Dengue 1 and Dengue 3 are most closely related (77% homologous), followed 
by Dengue 2 (69%) and Dengue 4 (62%). These homologies are well in the range that allows efficient family shuffling 
(Crameri et al. (1 998) Nature 391 : 288-291 ). 

40 [0219] The shuffled antigen sequences are incorporated into genetic vaccine vectors, the plasmids purified, and 
subsequently injected into mice. The sera are collected from the mice and analyzed for the presence of high levels of 
cross-reactive antibodies. The best antigens are selected for further studies using in vivo challenge models to screen 
for chimeras/mutants that induce cross-protection against all strains of Dengue. 

45 c. Improved Expression and Immunogenicity of Hantaan virus Glycoproteins 

[0220] One of the advantages of genetic vaccines is that vectors expressing pathogen antigens can be generated 
even when the given pathogen cannot be isolated in culture. An example of such potential situation was an outbreak 
of severe respiratory disease among rural residents of the Southwestern United States which was caused by a previ- 
50 ously unknown hantavirus, Sin Nombre virus (Hjelle et al. (1994) J. Virol. 68: 592). Much RNA sequence information 
of the virus was obtained well before the virus could be isolated and characterized in vitro. In these situations, genetic 
vaccines can provide means to generate efficient vaccines in a short period of time by creating vectors encoding 
antigens encoded by the pathogen. However, genetic vaccines can only work if these antigens can be properly ex- 
pressed in the host. 

55 [0221 ] Hantaan virus belongs to the Bunyavirus family. A characteristic feature of this family is that their glycoproteins 
typically accumulate at the membranes of the Golgi apparatus when expressed by cloned cDNAs, thereby reducing 
the efficacy of corresponding genetic vaccines (Matsuoka et al. (1991) Curr. Top. Microb. Immunol. 169: 161-179). 
Poor expression of Hantaan virus glycoproteins on the cell surface is also one explanation for poor immune responses 
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following injections of Hantaan virus genetic vaccines. 

[0222] In this Example, family DNA shuffling is used to generate recombinant Hantaan virus derived glycoproteins 
that are efficiently expressed in human cells and that can induce protective immune responses against the wild-type 
pathogen. Nucleic acids that encode the Hantaan virus glycoprotein are shuffled with genes that encode other homol- 
5 ogous Bunyavirus glycoproteins. The resulting library is screened to identify proteins that are readily expressed in 
human cells. The screening is performed using a dual marker expression vector that enables simultaneous analysis 
of transfection efficiency and expression of fusion proteins that are PIG-linked to the cell surface (Whitehorn et at. 
(1995) Biotechnology (N Y) 13:1215-9). 

[0223] Flow cytometry based cell sorting is used to select Hantaan virus glycoprotein variants that are efficiently 
10 expressed in mammalian cells. The corresponding sequences are then obtained by PCR or plasmid recovery. These 
chimeras/mutants are further analyzed for their capacity to protect wild mice against Hantaan virus infections. 

Example 7 

15 DNA Shuffling Of HSV-1 And HSV-2 Glycoproteins B And/Or D As Means To Induce Enhanced Protective 
Immune Responses 

[0224] This Example describes the use of DNA shuffling to obtain HSV glycoprotein B (gB) and glycoprotein D (gD) 
polypeptides that exhibit improved ability to induce protective immune responses upon administration to a mammal. 

20 Epidemiological studies have shown that prior infections with HSV-1 give partial protection against infections with HSV- 
2, indicating existence of cross-reactive immune responses. Based on previous vaccination studies, the main immu- 
nogenic glycoproteins in HSV appear to be gB and gD, which are encoded by 2.7 kb and 1.2 kb genes, respectively. 
The gB and gD genes of HSV-1 are about 85% identical to the corresponding gene of HSV-2, and the gB genes of 
each share little sequence identity with the gD genes. Baboon HSV-2 gB is appr. 75% identical to human HSV-1 or -2 

25 gB, with rather long stretches of almost 90% identity. In addition, 60-75% identity is found in portions of the genes of 
equine and bovine herpesviruses. 

[0225] Family shuffling is employed using as substrates nucleic acids that encode gB and/or gD from HSV-1 and 
HSV-2. Preferably, homologous genes are obtained from HSVs of various strains. An alignment of gD nucleotide se- 
quences from HSV-1 and two strains of HSV-2 is shown in Figure 7. Antigens encoded by the shuffled nucleic acids 
30 are expressed and analyzed in vivo. For example, one can screen for improved induction of neutralizing antibodies 
and/or CTL responses against HSV-1 /HSV-2. One can also detect protective immunity by challenging mice or guinea 
pigs with the viruses. Screening can be done using pools or individuals clones. 

Example 8 

35 

Evolution Of HIV Gp120 Proteins For Induction Of Broad Spectrum Neutralizing Ab Responses 

[0226] This Example describes the use of DNA shuffling to generate immunogens that crossreact among different 
strains of viruses, unlike the wild-type immunogens. Shuffling two kinds of envelope sequences can generate immu- 
40 nogens that induce neutralizing antibodies against a third strain. 

[0227] Antibody-mediated neutralization of HIV-1 is strictly type-specific. Although neutralizing activity broadens in 
infected individuals over time, induction of such antibodies by vaccination has been shown to be extremely difficult. 
Antibody-mediated protection from HIV-1 infection in vivo correlates with antibody-mediated neutralization of virus in 
vitro. 

45 [0228] Figure 8 illustrates the generation of libraries of shuffled gp120 genes. gp120 genes derived from HIV-1 DH12 
and H IV-I IIIB(NL43) are shuffled. The chimeric/mutant gp 120 genes are then analyzed for their capacity to induce 
antibodies that have broad spectrum capacity to neutralize different strains of HIV. Individual shuffled gp1 20 genes are 
incorporated into genetic vaccine vectors, which are then introduced to mice by injection or topical application onto the 
skin. These antigens can also be delivered as purified recombinant proteins. The immune responses are measured 

50 by analyzing the capacity of the mouse sera to neutralize HIV growth in vitro. Neutralization assays are performed 
against HIV-1 DH12, HIV-1 1MB and HIV-1 89.6. The chimeras/mutants that demonstrate broad spectrum neutralization 
are chosen for further rounds of shuffling and selection. Additional studies are performed in monkeys to illustrate the 
capacity of the shuffled gp120 genes to provide protection for subsequent infection with immunodeficiency virus. 

55 
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Example 9 

Antigen Shuttling of the Hepadnavirus Envelope Protein 

5 [0229] The Hepatitis B virus (HBV) is one of a member of a family of viruses called hepadnaviruses. This Example 
describes the use of genomes and individual genes from this family are used for DN A shuffling, which results in antigens 
having improved properties. 

A. Shuffling of Hepadnavirus envelope protein genes 

10 

[0230] The envelope protein of the HBV assembles to form particles that carry the antigenic structures collectively 
known as the Hepatitis B surface antigen (HBsAg; this term is also used to designate the protein itself). Antibodies to 
the major antigenic site, designated the "a" epitope (which is found in the envelope domain called S), are capable of 
neutralizing the virus. Immunization with the HBsAg-bearing protein thus serves as a vaccine against viral infection. 
15 The HBV envelope also contains other antigenic sites that can protect against viral infection and are potentially vital 
components of an improved vaccine. The epitopes are part of the envelope protein domains known as preS1 and 
preS2 (Figure 9). 

[0231] DNA shuffling of the envelope gene from several members of the hepadnavirus family is used to obtain more 
immunogenic proteins. Specifically, the genes from the following hepatitis viruses are shuffled: 

20 

the human HBV viruses, subtypes ayw and adw2 

a hepatitis virus isolated from chimpanzee 

25 . a hepatitis virus isolated from gibbon 

a hepatitis virus isolated from woodchuck 

[0232] If desired, genes from other genotypes of the human virus are available for inclusion in the DNA shuffling 
30 reactions. Likewise, other animal hepadnaviruses are available. 

[0233] To promote the efficiency of the formation of chimeras resulting from DNA shuffling, some artificial genes are 
made: 

In one case, a synthetic gene is made that contains the HBV envelope sequences, except for those codons which 
35 specify amino acids found in the chimpanzee and gibbon genes. For those codons, the chimpanzee or gibbon 

sequence is used. 

In a second case, a synthetic gene is synthesized in which the preS2 gene sequence from the human HBV adw2 
strain is fused with the woodchuck S region. 

40 

In a third case, all the oligonucleotides required to chemically synthesize each of the hepadnavirus envelope genes 
are mixed in approximately equal quantities and allowed to anneal to form a library of sequences. 

[0234] After DNA shuffling of the hepadnavirus envelope genes, either or both of two strategies are used to obtain 
45 improved HBsAg antigens. 

[0235] Strategy A: Antigens are screened by immunizing mice using two possible methods. The genes are injected 
in the form of DNA vaccines, i.e., shuffled envelope genes carried by a plasmid that comprises the genetic regulatory 
elements required for expression of the envelope proteins. Alternatively, the protein is prepared from the shuffled genes 
and used as the immunogen. 

so [0236] The sequences that give rise to greater immunogenicity for either the preS1-, preS2- or S-bome HBV antigens 
are selected for a second round of shuffling (Figure 1 0). For the second round, the best candidates are chosen based 
on their improved antigenicity and their other properties such as higher expression level or more efficient secretion. 
Screening and further rounds of shuffling are continued until a maximum optimization for one of the antigenic regions 
is obtained. 

55 [0237] The individually optimized genes are then used as a combination vaccine for the induction of optimal respons- 
es to preS1-, preS2- and S-bom epitopes. 

[0238] Strategy B: After isolation of the individually optimized genes as in Strategy A, the preS1 , preS2 and S can- 
didates are shuffled together, or in a pairwise fashion, in further rounds to obtain genes which encode proteins that 
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demonstrate improved immunogenicity for at least two regions containing HBsAg epitopes (Figure 11). 

B. Use of HBsAg to carry epitopes from unrelated antigens 

5 [0239] Several of the characteristics of the HBsAg make it a useful protein to carry epitopes drawn from other, un- 
related antigens. The epitopes can be either B epitopes (which induce antibodies) or T epitopes drawn from the class 
I type (which stimulate CD8+ T lymphocytes and induced cytotoxic cells) or class II type (which induce helper T lym- 
phocytes and are important in providing immunological memory responses. 

10 1. B cell epitopes 

[0240] Amino acid sequences of potential B epitopes are chosen from any pathogen. Such sequences are often 
known to induce antibodies, but the immunogenicity is weak or otherwise unsatisfactory for preparation of a vaccine. 
These sequences can also be mimotopes, which have been selected based on their ability to have a certain antigenicity 
15 or immunogenicity. 

[0241 ] The amino acid coding sequences are added to a hepadnavirus envelope gene. The heterologous sequences 
can either replace certain envelope sequences, or be added in addition to all the envelope sequences. The heterologous 
epitope sequences can be placed at any position in the envelope gene. A preferred position is the region of the envelope 
gene that encodes the major "a" epitope of the HBsAg (Figure 12). This region is likely to be exposed on the external 
20 side of the particles formed by the envelope protein, and thus will expose the heterologous epitopes. 

[0242] DNA shuffling is carried out on the envelope gene sequences, keeping the sequence of the heterologous 
epitopes constant. Screening is carried out to choose candidates that are secreted into the culture medium after trans- 
fection of plasmids from the shuffled library into cells in tissue culture. 

[0243] Clones that encode a secreted protein are then tested for immunogenicity in mice either as a DNA vaccine 
25 or as a protein antigen, as described above. Clones that give an improved induction of antibodies to the heterologous 
epitopes are chosen for further rounds of DNA shuffling. The process is continued until the immunogenicity of the 
heterologous epitope is sufficient for use as a vaccine against the pathogen from which the heterologous epitopes 
were derived. 

30 2. Class I epitopes 

[0244] MHC Class I epitopes are relatively short, linear peptide sequences that are generally between 6 and 12 
amino acids amino acids in length, most often 9 amino acids in length. These epitopes are processed by antigen- 
presenting cells either after synthesis of the epitope within the cell (usually as part of a larger protein) or after uptake 

35 of soluble protein by the cells. 

[0245] Polynucleotide sequences that encode one or more class I epitopes are inserted into the sequence of a 
hepadnavirus envelope gene either by replacing certain envelope sequences, or by inserting the epitope sequences 
into the envelope gene. This is typically done by modifying the gene before DNA shuffling or by including in the shuffling 
reaction certain oligonucleotide fragments that encode the heterologous epitopes as well as sufficient flanking hepad- 

40 navirus sequences to be incorporated into the shuffled products. 

[0246] Preferably, the heterologous class I epitopes are placed into different positions in the several hepadnavirus 
genes used for the DNA shuffling reaction. This will optimize the chances for finding chimeric gene carrying the epitopes 
in an optimal position for efficient presentation. 

45 3. Class II epitopes 

[0247] MHC Class II epitopes are generally required to be part of a protein which is taken up by antigen presenting 
cells, rather than synthesized within the cell. Preferably, such epitopes are incorporated into a carrier protein such as 
the HBV envelope that can be produced in a soluble form or which can be secreted if the gene is delivered in the form 
50 of a DNA vaccine. 

[0248] Polynucleotides that encode heterologous class II epitopes are inserted into regions of the hepadnavirus 
envelope genes that are not involved in the transmembrane structure of the protein. DNA shuffling is performed to 
obtain a secreted protein that also carries the class 1 1 epitopes. When injected as a protein, or when the gene is delivered 
as a DNA vaccine, the protein can be taken up by antigen presenting cells for processing of the class II epitopes. 

55 
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Example 10 

Evolution of broad spectrum vaccines against Hepatitis C Virus. 

5 [0249] Antigenic heterogeneity of different strains of Hepatitis C Virus (HCV) is a major problem in development of 
efficient vaccines against HCV. Antibodies or CTLs specific for one strain of HCV typically do not protects against other 
strains. Multivalent vaccine antigens that simultaneously protect against several strains of HCV would be of major 
importance when developing efficient vaccines against HCV. 

[0250] The HCV envelope genes, which encode envelope proteins E1 and E2, have been shown to induce both 

10 antibody and lymphoproliferative responses against these antigens (Lee et al. (1998) J. Virol. 72: 8430-6), and these 
responses can be optimized by DNA shuffling. The hypervariable region 1 (HVR1) of the envelope protein E2 of HCV 
is the most variable antigenic fragment in the whole viral genome and is primarily responsible for the large inter- and 
intra-individual heterogeneity of the infecting virus (Puntoriero etal. (1998) EMBOJ, 17: 3521-33). Therefore, the gene 
encoding E2 is a particularly useful target for evolution by DNA shuffling. 

15 [0251] DNA shuffling of HCV antigens, such as nucleocapsid or envelope proteins E1, E2, provides a means to 
generate multivalent HCV vaccines that simultaneously protect against several strains of HCV. These antigens are 
shuffled using the family DNA shuffling approach. The starting genes will be obtained from various natural isolates of 
HCV. In addition, related genes from other viruses can be used to increase the number of different recombinants that 
are generated. A library of related, chimeric variants of HCV antigens are then generated and this library will be screened 

20 for induction of widely crossreactive immune responses. The screening can be done directly in vivo by injecting indi- 
vidual variants into test animals, such as mice or monkeys. Either purified recombinant proteins or DNA vaccines 
encoding the relevant genes are injected. Typically, a booster injection is given 2-3 weeks after the first injection. 
Thereafter, the sera of the test animals are collected and these sera are tested for the presence of antibodies that react 
against multiple HCV virus isolates. 

25 [0252] Before the in vivo testing is initiated, the antigens can be pre-enriched in vitro for antigens that are recognized 
by polyclonal antisera derived from previously infected patients or test animals. Alternatively, monoclonal antibodies 
that are specific for various strains of HCV are used. The screening is performed using phage display or ELISA assays. 
For example, the antigen variants are expressed on bacteriophage M13 and the phage are then incubated on plates 
coated with antisera derived from patients or test animals infected with various HCV isolates. The phage that bind to 

30 the antibodies are then eluted and further analyzed in test animals for induction of crossreactive antibodies. 

Example 11 

Evolution of chimeric allergens that induce broad immune responses and have reduced risk of inducing 
35 anaphylactic reactions 

[0253] Specific immunotherapy of allergy is performed by injecting increasing amounts of the given allergens into 
the patients. The therapy typically alters the types of allergen-specific immune responses from a dominating T helper 
2 (T H 2) type response to a dominating T helper 1 (T H 1 ) type response. However, because allergic patients have 
40 increased levels of IgE antibodies specific for the allergens, the immunotherapy of allergy involves a riskof IgE receptor 
mediated anaphylactic reactions. 

[0254] T helper (T H ) cells are capable of producing a large number of different cytokines, and based on their cytokine 
synthesis pattern T H cells are divided into two subsets (Paul and Seder (1994) Ceil 76: 241-251). T H 1 cells produce 
high levels of IL-2 and IFN-gamma and no or minimal levels of IL-4, IL-5 and IL-13. In contrast, T H 2 cells produce high 

45 levels of IL-4, IL-5 and IL-13, whereas IL-2 and IFN-gamma production is minimal or absent. T H 1 cells activate mac- 
rophages, dendritic cells and augment the cytolytic activity of CD8+ cytotoxic T lymphocytes and NK cells (Id.), whereas 
T H 2 cells provide efficient help for B cells and they also mediate allergic responses due to the capacity of T H 2 cells to 
induce IgE isotype switching and differentiation of B cells into IgE secreting cell (De Vries and Punnonen (1996) In 
Cytokine regulation of humoral immunity: basic and clinical aspects. Eds. Snapper, CM., John Wiley & Sons, Ltd., 

50 West Sussex, UK, pp. 195-215 

[0255] This Example describes methods to generate chimeric allergens that can broadly modulate allergic immune 
responses. This can be achieved by DNA shuffling of related allergen genes to generate chimeric genes. In addition, 
chimeric/mutated allergens are less likely to be recognized by preexisting IgE antibodies of the patients. Importantly, 
allergen variants that are not recognized by IgE antibodies can be selected using patient sera and negative selection 

55 (Figure 13). 

[0256] As one example, chimeric allergen variants of Der p2, Der f2, Tyr p2 Lep d2 and Gly d2 allergens are gener- 
ated. These house dust mite allergens are very common in exacerbating allergic and asthmatic symptoms, and im- 
proved means to downregulate such allergic immune responses are desired. House dust mites can be used as sources 
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of the genes. The corresponding genes are shuffled using family DNA shuffling and a shuffled library is generated. 
Phage display is used to exclude allergens that are recognized by antibodies from allergic individuals. It is particularly 
important is to exclude variants that are recognized by IgE antibodies. Phage expressing the allergen variants are 
incubated with pools of sera derived from allergic individuals. The phage that are recognized by IgE antibodies are 

5 removed, and the remaining allergens are further tested in vitro and in wVo for their capacity to activate allergen-specific 
human T cells (Figure 1 4). Because immunotherapy of allergy is believed to function through induction of a dominating 
T H 1 response as compared to allergic T H 2 response, efficient T cell activation and induction of a T H 1 type response 
by allergen variants is used as a measure of the efficacy of the allergens to modulate allergic T cell responses. 
[0257] The optimal allergen variants are then further tested in vivo by studying skin responses after injections to the 

10 skin. A strong inflammatory response around the injection site is an indication of efficient T cell activation, and the 
allergen variants that induce the most efficient delayed type T cell response (typically observed 24 hours after the 
injection) are the best candidates for further studies in vivo to identify allergens that effectively downregulate allergic 
immune responses. Accordingly, these allergen variants re analyzed for their capacity to inhibit allergic responses in 
allergic, atopic and asthmatic individuals. The screening of allergen variants is further illustrated in Figure 1 3 and Figure 

15 14. 

Example 12 

Evolution of cancer antigens that induce efficient anti-tumor immune responses 

20 

[0258] Several cancer cells express antigens that are present at significantly higher levels on the malignant cells 
than on other cells in the body. Such antigens provide excellent targets for preventive cancer vaccines and immuno- 
therapy of cancer. The immunogenicity of such antigens can be improved by DNA shuffling. In addition, DNA shuffling 
provides means to improve expression levels of cancer antigens. 

25 [0259] This Example describes methods to generate cancer antigens that can efficiently induce anti-tumor immune 
responses by DNA shuffling of related cancer antigen genes. Libraries of shuffled melanoma-associated glycoprotein 
(gp1 00/pmel1 7) genes (Huang et ai. (1 998) J. invest. Dermatol, 111: 662-7) are generated. The genes can be isolated 
from melanoma cells obtained from various patients, who may have mutations of the gene, increasing the diversity in 
the starting genes. In addition, a gp100 gene can be isolated from other mammalian species to further increase the 

30 diversity of starting genes. A typical method for the isolation of the genes is RT-PCR. The corresponding genes are 
shuffled using single gene DNA shuffling or family DNA shuffling and a shuffled library is generated. 
[0260] The shuffled gp 100 variants, either pools or individual clones, are subsequently injected into test animals, 
and the immune responses are studied (Figure 15). The shuffled antigens are either expressed in E. cofi and recom- 
binant, purified proteins are injected, or the antigen genes are used as components of DNA vaccines. The immune 

35 response can be analyzed for example by measuring anti-gp100 antibodies, as previous studies indicate that the 
antigen can induce specific antibody responses (Huang et ai., supra.). Alternatively, the test animals that can be chal- 
lenged by malignant cells expressing gp100. Animals that have been efficiently immunized will generate cytotoxic T 
cells specific for gp100 and will survive the challenge, whereas in non-immunized or poorly immunized animals the 
malignant cells will efficiently grow eventually resulting in lethal expansion of the cells. Furthermore, antigens that 

40 induce cytotoxic T cells that have the capacity to kill cancer cells can be identified by measuring the capacity of T cells 
derived from immunized animals to kill cancer cells in vitro. Typically the cancer cells are first labeled with radioactive 
isotopes and the release of radioactivity is an indication of tumor cell killing after incubation in the presence of T cells 
from immunized animals. Such cytotoxicity assays are known in the art. 

[0261] The antigens that induce highest levels of specific antibodies and/or can protect against the highest number 
45 of malignant cells can be chosen for additional rounds of shuffling and screening. Mice are useful test animals because 
large numbers of antigens can be studied. However, monkeys are a preferred test animal, because the MHC molecules 
of monkeys are very similar to those of humans. 

[0262] To screen for antigens that have optimal capacity to activate antigen-specific T cells, peripheral blood mono- 
nuclear cells from previously infected or immunized humans individuals can be used. This is a particularly useful meth- 
50 od, because the MHC molecules that will present the antigenic peptides are human MHC molecules. Shuffled cancer 
antigens that induce cytotoxic T cells that have the capacity to kill cancer cells can be identified by measuring the 
capacity of T cells derived from immunized animals to kill cancer cells in vitro. Typically the cancer cells are first labeled 
with radioactive isotopes and the release of radioactivity is an indication of tumor cell killing after incubation in the 
presence of T cells from immunized animals. Such cytotoxicity assays are known in the art. 

55 
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Example 13 

Evolution of autoantigens that induce efficient immune responses 

5 [0263] Autoimmune diseases are characterized by an immune response directed against self antigens expressed 
by the host. Autoimmune responses are generally mediated by T H 1 cells that produce high levels of IL-2 and IFN- 
gamma. Vaccines that can direct autoantigen specific T cells towards T H 2 phenotype producing increased levels of 
IL-4 and IL-5 would be beneficial. For such vaccines to work, the vaccine antigens have to be able to efficiently activate 
specific T cells. DNA shuffling can be used to generate antigens that have such properties. To optimally induce T H 2 

10 cell differentiation it may be beneficial to coadminister cytokines that have been shown to enhance T H 2 cell activation 
and differentiation, such as IL-4 (Racke et al. (1994) J. Exp. Med. 180: 1961-66). 

[0264] This Example describes methods for generating autoantigens that can efficiently induce immune responses. 
DNA shuffling is performed on related autoantigen genes. For example, libraries of shuffled myelin basic proteins, or 
fragments thereof (Zamvil and Steinman (1990) Ann. Rev. Immunol. 8: 579-621); Brocke et al. (1996) Nature 379: 

15 343-46) are generated. MBP is considered to be an important autoantigen in patients with multiple sclerosis (MS). The 
genes encoding MBP from at least bovine, mouse, rat, guinea pig and human have been isolated providing an excellent 
starting point for family shuffling. A typical method for the isolation of the genes is RT-PCR. The shuffled MBP variants, 
either pools or individual clones, are subsequently injected into test animals, and the immune responses are studied. 
The shuffled antigens are either expressed in E. coll and recombinant, purified proteins are injected, or the antigen 

20 genes are used as components of DNA vaccines or viral vectors. The immune response can be analyzed for example 
by measuring anti-MBP antibodies by ELISA. Alternatively, the lymphocytes derived from immunized test animals are 
activated with MBP, and the T cell proliferation or cytokine synthesis is studies. A sensitive assays for cytokine synthesis 
is ELISPOT (McCutcheon et al. (1997) J. Immunol. Methods 210: 149-66). Mice are useful test animals because large 
numbers of antigens can be studied. However, monkeys are a preferred test animal, because the MHC molecules of 

25 monkeys are very similar to those of humans. 

[0265] To screen for antigens that have optimal capacity to activate MBP specific T cells peripheral blood mononu- 
clear cells from patients with MS can also be used. This is a particularly useful method, because the MHC molecules 
that will present the antigenic peptides are human MHC molecules. Shuffled antigens that activate MBP specific T cells 
can be identified by measuring the capacity of T cells derived from MS patients to proliferate or produce cytokines 

30 upon culture in the presence of the antigen variants. Such assays are known in the art. One such assay is ELISPOT 
(McCutcheon et al., supra.). An indication of the efficacy of an MBP variant to activate specific T cells is also the degree 
of skin inflammation when the antigen is injected into the skin of a patient with MS. Strong inflammation is correlated 
with strong activation of antigen-specific T cells. Improved activation of MBP specific T cells, particularly in the presence 
of IL-4, is likely to result in enhanced T H 2 cell responses, which are beneficial in the treatment of MS patients. 

35 

Example 14 

Method of Optimizing the Immunogenicity of Hepatitis B Surface Antigen 

40 [0266] This Example describes methods by which the envelope protein sequence of the hepatitis B virus can be 
evolved to provide a more immunogenic surface antigen. Such a protein is important for vaccination of low responders 
and for immunotherapy of chronic hepatitis B. 

Background 

45 

[0267] Current HBV vaccines (Merck, SKB) are based on the immunogenicity of the viral envelope protein and contain 
the Major (or Small) form of the envelope protein produced as particles in yeast. These particles induce antibodies to 
the major surface antigen (HBsAg) which can protect against infection when antibody levels are at least 1 0 milli-l nter- 
national Units per milliliter (mU/ml). These recombinant protein preparations are not capable of inducing humoral im- 

50 munity in chronic carriers (some 300 million cases worldwide) the induction of which would be important to control virus 
spread. Moreover, certain individuals respond poorly to the vaccine (up to 30-50% of vaccinees in some groups) and 
do not develop protective levels of antibody. The inclusion of the natural epitope sequences contained in the Middle 
or Large forms of the viral envelope protein has been used as a method to increase the immunogenicity of vaccine 
preparations. An alternative method is to introduce new (i.e., not present in the natural virus sequence) helper T-cell 

55 epitopes into the HBsAg sequence using DNA shuffling technology. 
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Method 

[0268] DNA sequences of HBsAg from different subtypes of HBV (e.g., ayw and adr) and the related woodchuck 
hepatitis virus are prepared for shuffling. Comparison of the genes encoding these proteins suggests that recombination 

5 would occur at least ten times within 850 base pairs when shuffling the ayw and woodchuck hepatitis virus (WHV) DNA 
sequences. Nucleotide and amino acid sequences of portions of different subtypes of HBV are shown in Figure 17. 
[0269] The sequence of the main HBsAg B-cell antigenic site (the "a" epitope) can be retained in the protein sequence 
by including the coding sequences of the external "a" loop in the final protein preparation. Peptide analogue(s) for the 
"a" epitope ofHBsAg have been described (Neurath etal. (1984) J. ViroL Methods 9:341 -346), and the immunogenicity 

10 of the "a" epitope has been demonstrated (Bhatnagar etal. (1982) Proc. Natl, Acad. Sci. USA 79: 4400-4404). HBsAg 
and WHsAg share the major "a" determinant, and chimps can be protected by both antigens (Cote etal. (1 986) J. Virol. 
60: 895-901). Likewise, important CTL epitopes can be included in the protein in a defined way. 
[0270] One can also easily introduce B or T (helper or CTL) epitopes from other antigens into the shuffled HBsAg 
sequence. This may focus the immune response to certain epitopes, independent of other potentially dominant epitopes 

15 from the same protein. Furthermore, the availability of the "a" loop on the HBsAg may provide a region of the envelope 
protein into which other artificial antigens or mimotopes could be included. 

[0271] In all cases where a novel HBV envelope sequence is prepared to include a specific epitope (from HBV, 
another pathogen or a tumor cell), shuffling of the surrounding sequences in the HBV envelope will serve to optimize 
expression of the protein and help to ensure that the immune response is directed to the desired epitope. 
20 [0272] Several methods of analyzing and utilizing shuffled HBsAg sequences are described below. 

A. Modulating expression levels of HBsAg 

[0273] Shuffled HBsAg sequences are introduced into cells in culture and the ability to direct expression of secreted 
25 HBsAg (measured with clinical kits for HBsAg expression) is evaluated. This can be used to identify shuffled HBsAg 
sequences which exhibit optimized HBsAg expression levels. Such coding sequences are particularly interesting for 
DNA vaccination. 

B. Circumventing low responsiveness to the HBsAg 

30 

[0274] Shuffled HBsAg sequences are evaluated for their ability to induce an immune response to the clinically 
relevant HBsAg epitopes. This can be done using mice of the H-2s and H-2f haplotypes, which respond poorly or not 
at all to HBsAg protein immunization. In these experiments, one can verify that antibodies are generated to the main 
"a" epitope in the S protein, and a second protective epitope in the PreS2 region (a linear sequence). 

35 [0275] The PreS2 and S coding sequences for the envelope protein (HBsAg) from the HBV ayw subtype (plasmid 
pCAG-M-Kan; Whalen) and the WHV (plasmid pWHV8 from ATCC) are amplified from the two plasmids by PCR and 
shuffled. Examples of suitable primers for PCR amplification are shown in Figure 1 8. The shuffled library of sequences 
is cloned into an HBsAg-expression vector and individual colonies are chosen for preparation of plasmid DNA. The 
DNA is administered to the test animals and vectors which induce the desired immune response are identified and 

40 recovered. 

C. Presentation of natural HBsAg CTL epitopes by evolved HBsAg proteins 

[0276] This example describes methods of using the evolved HBsAg protein to present natural HBsAg CTL epitopes. 
45 Shuffling is used to increase overall immunogenicity of the HBsAg protein, as discussed above. However, some of the 
evolved HBsAg sequences are replaced with class I or class II epitope sequences from the natural HBsAg protein in 
order to stimulate immunoreactivity specifically to these natural viral epitopes. Alternatively, the natural viral epitopes 
can be added to the evolved protein without loss of immunogenicity of the evolved HBsAg. 

50 d. Expression of tumor-devided CTL, epitopes by evolved HBsAg proteins 

[0277] This example describes methods of using the evolved HBsAg protein is used to express tumor-derived CTL 
epitopes. The overall immunogenicity of the HBsAg protein is increased by shuffling. However, some of the evolved 
HBsAg sequences are replaced with class I or class II epitope sequences from tumor cells in order to stimulate immu- 
55 noreactivity specifically to these natural viral epitopes. Alternatively, the tumor cells epitopes can be added to the 
evolved protein without loss of immunogenicity of the evolved HBsAg. 
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E. Expression of minotope sequences by the HBsAg 

[0278] This example describes the use of an evolved HBsAg protein for expression of mimotope sequences. Again, 
the evolved HBsAg protein is used to increase overall immunogenicity of the protein. However, some of the evolved 
5 HBsAg sequences are replaced with mimotope sequences to stimulate immunoreactivity specifically to the natural 
sequence which cross reacts with the mimotope. Alternatively, the mimotope sequences can be added to the evolved 
protein without loss of immunogenicity of the evolved HBsAg. 

Example 15 

10 

Fusion Proteins Of The HBsAg Polypeptide and HIV gp120 Protein 

[0279] This Example describes the preparation of fusion proteins ("chimeras") formed from the HBsAg polypeptide 
and the extracellular fragment gp120 of the HIV envelope protein, and their use as vaccines. 

15 

Background 

[0280] When used as a vaccine, recombinant monomeric gp120 has failed to induce antibodies that have strong 
neutralizing activity with primary isolates of the HIV virus. It has been suggested that oligomeric forms of the HIV 

20 envelope protein which expose certain regions of the tertiary structure would be better able to elicit virus-neutralizing 
antibodies (Parrin et al. (1997) Immunol. Lett. 57: 105-112; VanCott etal. (1997) J. Virol. 71 : 431 9-4330; 
[0281] In this Example, DNA shuffling is applied to this problem, in order to obtain gp 120 polypeptides which adopt 
conformations slightly different from those of previous preparations of recombinant gp 120. To allow the individual gp 
120 molecules to interact as oligomers, a fusion is prepared between gp120 sequences (on the N-terminus of the 

25 fusion) and HBsAg sequences (on the C-terminal of the fusion). 

[0282] The N-terminal peptide sequence of the S region of the HBsAg polypeptide is a transmembrane structure 
which is locked into the membrane of the endoplasmic reticulum. The actual N-terminus of the S region as well as the 
preS2 sequences are located in the lumenal part of the ER. They are found on the outside of the final HBsAg particles. 
By placing the gp120 sequences on the N-terminus of the HBsAg preS2 or S sequences, the gp120 sequences are 

30 also located on the outside of the particles. The gp1 20 molecules can thus be brought together in three-dimensional 
space to interact as in the virus. 

[0283] Since the exact conformation of the final chimera which will have the most appropriate immunogenicity cannot 
be predicted, DNA shuffling is employed. The sequences of the HBsAg polypeptide, which functions as a scaffold, and 
of gp120 are both shuffled. Screening of the shuffled products can be performed by ELISA assay using antibodies 
35 (polyclonal or monoclonal) which have previously been determined to have virus neutralizing activity. 

Method 

[0284] The sequences encoding the gp 120 fragment of the HIV envelope protein are preferably prepared as a 
40 synthetic gene to include codons which are optimal for gene expression in mammals. The gp1 20 sequence will typically 
include a signal sequence on its N-terminal end. 

[0285] The gp1 20 sequences are inserted into the preS2 region of an HBsAg-expressing plasmid. In the preS2 region 
of the plasmid pMKan and its derivatives, an EcoRI site and an Xho\ site are available for cloning. The gp1 20 sequences 
can be inserted between these two sites, which brings the gp1 20 closer to the start of the S coding sequences, or into 
45 the EcoRI site alone, which leaves a spacer sequence of about 50 amino acids between the gp1 20 sequence and the 
start of the S region of the HBsAg. These two different cloning strategies will give rise to chimeric molecules in which 
the gp120 sequences are located at different distances from the transmembrane region of the HBsAg sequence. This 
may be advantageous in allowing the gp 1 20 sequences to adopt conformations which are more suitable immunogens 
than monomeric gp120. ' 

so [0286] DNA shuffling of the entire chimeric sequence is carried out. Family shuffling is preferred; this involves the 
preparation of several gp120-HBsAg fusion proteins in which different gp120 and HBsAg (or WHV) sequences are 
used. An alignment of HBsAg nucleotide sequences is shown in Figure 1 9. After shuffling of the different sequences, 
the products are cloned into an expression vector such as pMKan. Pools of clones from the library of shuffled products 
are transfected into cultured cells and the secretion of chimeric proteins is assayed with broadly reactive antibodies to 

55 gp120. Positive clones can be further evaluated with particular antibodies that have demonstrated HIV neutralizing 
activity, for example the anti-CD4 binding domain recombinant human monoclonal antibody, lgGlb12 (Kessler et al. 
(1997) AIDS Res. Hum. Retroviruses. 1 : 13: 575-582; Roben etal. (1994) J. Virol. 68: 4821-4828). Candidate clones 
can then be used to immunize mice and the antiserum obtained is evaluated for HIV virus-neutralizing activity in in 
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vitro assays. 

[0287] Because the gp1 20 molecule (approx. 1 1 00 amino acids) is larger in size than the monomeric HBsAg preS2+S 
protein (282 amino acids), it is likely that not every HBsAg monomer in an aggregated particle will contain a gp120 
sequence. Internal initiation of protein synthesis can take place on the HBsAg coding sequences at the initiator me- 
5 thionine that marks the beginning of the S region. Thus, the chimeric molecule (which contains the gp120 sequences) 
will be mixed in the cell with the S region and the multimeric particles should assemble with an appropriate number of 
chimeric polypeptides and native HBsAg S monomers. Alternatively, an S-expressing plasmid can be mixed with the 
plasmid expressing the chimera, or a single plasmid which expresses the chimera and the S form can be constructed. 
A diagram of the resulting particles is shown in Figure 20. 

10 

Example 16 

DNA Shuffling Of HSV-1 And HSV-2 Glycoproteins B And/Or D As Means To Induce Enhanced Protective 
Immune Responses 

15 

[0288] This Example describes the use of DNA shuffling to obtain HSV glycoprotein B (gB) and glycoprotein D (gD) 
polypeptides that exhibit improved ability to induce protective immune responses upon administration to a mammal. 
Epidemiological studies have shown that prior infections with HSV-1 give partial protection against infections with HSV- 
2, indicating existence of cross-reactive immune responses. Based on previous vaccination studies, the main immu- 

20 nogenic glycoproteins in HSV appear to be gB and gD, which are encoded by 2.7 kb and 1 .2 kb genes, respectively. 
The gB and gD genes of HSV-1 are about 85% identical to the corresponding gene of HSV-2, and the gB genes of 
each share little sequence identity with the gD genes. Baboon HSV-2 gB is appr. 75% identical to human HSV-1 or -2 
gB, with rather long stretches of almost 90% identity. In addition, 60-75% identity is found in portions of the genes of 
equine and bovine herpesviruses. 

25 [0289] Family shuffling is employed using as substrates nucleic acids that encode gB and/or gD from HSV-1 and 
HSV-2. Preferably, homologous genes are obtained from HSVs of various strains. An alignment of gD nucleotide se- 
quences from HSV-1 and two strains of HSV-2 is shown in Figure 7. Antigens encoded by the shuffled nucleic acids 
are expressed and analyzed in vivo. For example, one can screen for improved induction of neutralizing antibodies 
and/or CTL responses against HSV-1 /HSV-2. One can also detect protective immunity by challenging mice or guinea 

30 pigs with the viruses. Screening can be done using pools or individuals clones. 

Example 17 

Evolution Of HIV Gp120 Proteins For Induction Of Broad Spectrum Neutralizing Ab Responses 

35 

[0290] This Example describes the use of DNA shuffling to generate immunogens that crossreact among different 
strains of viruses, unlike the wild-type immunogens. Shuffling two kinds of envelope sequences can generate immu- 
nogens that induce neutralizing antibodies against a third strain. 

[0291] Antibody-mediated neutralization of HIV-1 is strictly type-specific. Although neutralizing activity broadens in 
40 infected individuals over time, induction of such antibodies by vaccination has been shown to be extremely difficult. 
Antibody-mediated protection from HIV-1 infection in vivo correlates with antibody-mediated neutralization of virus in 
vitro. 

[0292] Figure 8 illustrates the generation of libraries of shuffled gp120 genes. gp120 genes derived from HIV-1 DH12 
and HIV-I II IB(NL43) are shuffled. The chimeric/mutant gp120 genes are then analyzed for their capacity to induce 

45 antibodies that have broad spectrum capacity to neutralize different strains of HIV. Individual shuffled gp1 20 genes are 
incorporated into genetic vaccine vectors, which are then introduced to mice by injection or topical application onto the 
skin. These antigens can also be delivered as purified recombinant proteins. The immune responses are measured 
by analyzing the capacity of the mouse sera to neutralize HIV growth in vitro. Neutralization assays are performed 
against HIV-1 DH12, HIV-IIIIB and HIV-1 89.6. The chimeras/mutants that demonstrate broad spectrum neutralization 

50 are chosen for further rounds of shuffling and selection. Additional studies are performed in monkeys to illustrate the 
capacity of the shuffled gp120 genes to provide protection for subsequent infection with immunodeficiency virus. 
[0293] It is understood that the examples and embodiments described herein are for illustrative purposes only and 
that various modifications or changes in light thereof will be suggested to persons skilled in the art and are to be included 
within the spirit and purview of this application and scope of the appended claims. All publications, patents, and patent 

55 applications cited herein are hereby incorporated by reference for all purposes. 
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SEQUENCE LISTING 

<110> Maxygen, inc. 

<120> Antigen Library immunization 

<130> 0157epl20 

<140> 04078487.8 
<141> 1999-02-10 

<150> US 09/021,769 
<151> 1998-02-11 

<150> US 60/074,294 
<151> 1998-02-11 

<150> US 60/105,509 
<151> 1998-10-23 

<160> 25 

<170> Patentln ver. 2.0 

<210> 1 
<211> 1185 
<212> DNA 

<213> Herpes Simplex virus 1 
<220> 

<223> glycoprotein D-l CgD-1) from Herpes Simplex Virus 
1 (HSV-l) 



<400> 1 
atggggggga 


ctgccgccag 


gttgggggcc 


gtgattttgt 


ttgtcgtcat 


agtgggcctc 


60 


catggggtcc 


gcggcaaata 


tgccttggcg 


gatgcctctc 


tcaagatggc 


cgaccccaat 


120 


cgctttcgcg 


gcaaagacct 


tccggtcctg 


gaccagctga 


ccgaccctcc 


gggggtccgg 


180 


cgcgtgtacc 


acatccaggc 


gggcctaccg 


gacccgttcc 


agccccccag 


cctcccgatc 


240 


acggtttact 


acgccgtgtt 


ggagcgcgcc 


tgccgcagcg 


tgctcctaaa 


cgcaccgtcg 


300 


gaggcccccc 


agattgtccg 


cggggcctcc 


gaagacgtcc 


ggaaacaacc 


ctacaacctg 


360 


accatcgctt 


ggtttcggat 


gggaggcaac 


tgtgctatcc 


ccatcacggt 


catggagtac 


420 


accgaatgct 


cctacaacaa 


gtctctgggg 


gcctgtccca 


tccgaacgca 


gccccgctgg 


480 


aactactatg 


acagcttcag 


cgccgtcagc 


gaggataacc 


tggggttcct 


gatgcacgcc 


540 


cccgcgtttg 


agaccgccgg 


cacgtacctg 


cggctcgtga 


agataaacga 


ctggacggag 


600 


attacacagt 


ttatcctgga 


gcaccgagcc 


aagggctcct 


gtaagtacgc 


cctcccgctg 


660 


cgcatccccc 


cgtcagcctg 


cctctccccc 


caggcctacc 


agcagggggt 


gacggtggac 


720 


agcatcggga 
ttgaagatcg 


tgctgccccg 
ccgggtggca 


cttcatcccc 
cgggcccaag 


gagaaccagc 
gccccataca 


gcaccgtcgc 
cgagcaccct 


cgtatacagc 
gctgcccccg 


780 
840 


gagctgtccg 


agacccccaa 


cgccacgcag 


ccagaactcg 


ccccggaaga 


ccccgaggat 


900 


tcggccctct 


tggaggaccc 


cgtggggacg 


gtggcgccgc 


aaatcccacc 


aaactggcac 


960 


atcccgtcga 


tccaggacgc 


cgcgacgcct 


taccatcccc 


cggccacccc 


gaacaacatg 


1020 


ggcctgatcg 


ccggcgcggt 


gggcggcagt 


ctcctggcag 


ccctggtcat 


ttgcggaatt 


1080 
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gtgtactgga tgcaccgccg cactcggaaa gccccaaagc gcatacgcct cccccacatc 1140 
cgggaagacg accagccgtc ctcgcaccag cccttgtttt actag 1185 

<210> 2 
<211> 1180 
<212> DNA 

<213> Herpes Simplex virus 2 
<220> 

<223> glycoprotein D-l (gD-1) from Herpes Simplex virus 
2 (HSV-2) 



<400> 2 
atggggcgtt 


tgacctccgg 


cgtcgggacg 


gcggccctgc 


tagttgtcgc 


ggtgggactc 


60 


cgcgtcgtct 


gcgccaaata 


cgccttagca 


gacccctcgc 


ttaagatggc 


cgatcccaat 


120 


cgatttcgcg 


ggaagaacct 


tccggttttg 


gaccagctga 


ccgacccccc 


cggggtgaag 


180 


cgtgtttacc 


acattcagcc 


gagcctggag 


gacccgttcc 


agccccccag 


catcccgatc 


240 


actgtgtact 


acgcagtgct 


ggaacgtgcc 


tgccgcagcg 


tgctcctaca 


tgccccatcg 


300 


gaggcccccc 


agatcgtgcg 


cggggcttcg 


gacgaggccc 


gaaagcacac 


gtacaacctg 


360 


accatcgcct 


ggtatcgcat 


gggagacaat 


tgcgctatcc 


ccatcacggt 


tatggaatac 


420 


accgagtgcc 


cctacaacaa 


gtcgttgggg 


gtctgcccca 


tccgaacgca 


gccccgctgg 


480 


agctactatg 


acagctttag 


cgccgtcagc 


gaggataacc 


tgggattcct 


gatgcacgcc 


540 


cccgccttcg 


agaccgcggg 


tacgtacctg 


cggctagtga 


agataaacga 


ctggacggag 


600 


atcacacaat 


ttatcctgga 


gcaccgggcc 


cgcgcctcct 


gcaagtacgc 


tctccccctg 


660 


cgcatccccc 


cggcagcgtg 


cctcacctcg 


aaggcctacc 


aacagggcgt 


gacggtcgac 


720 


agcatcggga 


tgttaccccg 


ctttactccc 


gaaaaccagc 


gcaccgtcgc 


cctatacagc 


780 


ttaaaaatcg 


ccgggtggca 


cggccccaag 


cccccgtaca 


ccagcaccct 


gctgccgccg 


840 


gagctgtccg 


acaccaccaa 


cgccacgcaa 


cccgaactcg 


ttccggaaga 


ccccgaggac 


900 


tcggccctct 


tagaggatcc 


cgccgggacg 


gtgtcttcgc 


agatcccccc 


aaactggcac 


960 


atcccgtcga 


tccaggacgt 


cgcgccgcac 


cacgcccccg 


ccgccccagc 


caacccgggc 


1020 


ctgatcatcg 


gcgcgctggc 


cggcagtacc 


ctggcggcgc 


tggtcatcgg 


cggtattgcg 


1080 


ttttgggtac 


gccgccggcg 


ctcagtggcc 


cccaagcgcc 


tacgtctccc 


ccacatccgg 


1140 


gatgacgacg 


cgcccccctc 


gcaccagcca 


ttgttttact 






1180 



<210> 3 
<211> 1242 
<212> DNA 

<213> Herpes Simplex virus 2 
<220> 

<223> glycoprotein D-2 CgD-2) from Herpes Simplex virus 
2 (HSV-2) 

<400> 3 

atggggcgtt tgacctccgg cgtcgggacg gcggccctgc tagttgtcgc ggtgggactc 60 
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cgcgtcgtct 


gcgccaaata 


cgccttagca 


gacccctcgc 


ttaagatggc 


cgatcccaat 


120 


cgatttcgcg 


ggaagaacct 


tccggttttg 


gaccagctga 


ccgacccccc 


cggggtgaag 


180 


cgtgtttacc 


acattcagcc 


gagcctggag 


gacccgttcc 


agccccccag 


catcccgatc 


240 


actgtgtact 


acgcagtgct 


ggaacgtgcc 


tgccgcagcg 


tgctcctaca 


tgccccatcg 


300 


gaggcccccc 


agatcgtgcg 


cggggcttcg 


gacgaggccc 


gaaagcacac 


gtacaacctg 


360 


accatcgcct 


ggtatcgcat 


gggagacaat 


tgcgctatcc 


ccatcacggt 


tatggaatac 


420 


accgagtgcc 


cctacaacaa 


gtcgttgggg 


gtctgcccca 


tccgaacgca 


gccccgctgg 


480 


agctactatg 


acagctttag 


cgccgtcagc 


gaggataacc 


tgggattcct 


catgcacgcc 


540 


cccgccttcg 


agaccgcggg 


tacgtacctg 


cggctagtga 


agataaacga 


ctggacggag 


600 


atcacacaat 


ttatcctgga 


gcaccgggcc 


cgcgcctcct 


gcaagtacgc 


tctccccctg 


660 


cgcatccccc 


cggcagcgtg 


cctcacctcg 


aaggcctacc 


aacagggcgt 


gacggtcgac 


720 


agcatcggga 


tgttaccccg 


ctttatcccc 


gaaaaccagc 


gcaccgtcgc 


cctatacagc 


780 


ttaaaaatcg 


ccgggtggca 


cggccccaag 


cccccgtaca 


ccagcaccct 


gctgccgccg 


840 


gagctgtccg 


acaccaccaa 


cgccacgcaa 


cccgaactcg 


ttccggaaga 


ccccgaggac 


900 


tcggccctct 


tagaggatcc 


cgccgggacg 


gtgtcttcgc 


agatcccccc 


aaactggcac 


960 


atcccgtcga 


tccaggacgt 


cgcgccgcac 


cacgcccccg 


ccgcccccag 


caacccgggc 


1020 


ctgatcatcg 


gcgcgctggc 


cggcagtacc 


ctggcggcgc 


tggtcatcgg 


cggtattgcg 


1080 


ttttgggtac 


gccgccgcgc 


tcagatggcc 


cccaagcgcc 


tacgtctccc 


ccacatccgg 


1140 


gatgacgacg 


cgcccccctc 


gcaccagcca 


ttgttttact 


agaggagttt 


ccccgttccc 


1200 


gtgtacctct 


gggcccgtgt 


gggagggtgg 


ccggggtatt 


tg 




1242 



<210> 4 
<211> 31 
<212> ON A 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence: primer 6025F 
<400> 4 

caagcttctc tatcaaagca gtaagtagta c 31 



<210> 5 
<2U> 25 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence:primer 7773R 
<400> 5 

cttcctgctg ctcccaagaa cccaa 25 



<210> 6 
<211> 25 
<212> DNA 
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<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: primer 6196F 

5 

<400> 6 

atagaaagag cagaagacag tggca 25 

<210> 7 
<211> 25 
10 <212> DNA 

<213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer 7746R 

15 <400> 7 

aacaaagctc ctattcccac tgctc 25 

<210> 8 
<211> 32 
<212> DNA 
20 <213> Artificial Sequence 

<220> 

<223> Description of Artificial Sequence: primer 
BSSH2-6205F 

25 <400> 8 

ttggcgcgca gaagacagtg gcaatgagag tg 32 

<210> 9 
<211> 846 
<212> DNA 

<213> Hepatitis B virus 

<220> 
<221> CDS 
<222> (1). .(846) 

<223> PreS2-s coding region of hepatitis B virus adr 
surface antigen (HBsAg) 

<400> 9 

atg cag tgg aac tec aca aca ttc cac caa get ctg eta gac ccc aga 48 
Met Gin Trp Asn Ser Thr Thr Phe His Gin Ala Leu Leu Asp Pro Arg 
15 10 15 



30 



gtg agg ggc eta tac ttt cct get ggt ggc tec agt tec gga aca gta 
vaT Arg Gly Leu Tyr Phe Pro Ala Gly Gly Ser Ser Ser Gly Thr Val 
20 25 30 



96 



aac cct gtt ccg act act gee tea ccc ata teg tea ate ttc teg agg 144 

Asn Pro val Pro Thr Thr Ala Ser Pro He Ser Ser lie Phe Ser Arg 

35 40 45 

act ggg gac cct gca ccg aac atg gag aac aca aca tea gga ttc eta 192 

Thr Gly Asp Pro Ala Pro Asn Met Glu Asn Thr Thr ser Gly Phe Leu 

50 55 60 

gga ccc ctg etc gtg tta cag gcg ggg ttt ttc ttg ttg aca aga ate 240 

50 Gly Pro Leu Leu Val Leu Gin Ala Gly Phe Phe Leu Leu Thr Arg lie 

65 70 75 80 

etc aca ata cca cag agt eta cac teg tgg tgg act tct etc aat ttt 288 

Leu Thr lie Pro Gin Ser Leu His Ser Trp Trp Thr ser Leu Asn Phe 

85 90 95 

55 
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eta gqg gca gca ccc acg tgt ctt gqc caa aat teg cag tec cca ace 
Leu Gly Ala Ala Pro Thr Cys Leu Gly Gin Asn Ser Gin ser Pro Thr 
100 105 110 

tec aat cac tea cca acc tct tgt cct cca att tgt cct gqt tat cgt 
ser Asn His ser Pro Thr Ser cys Pro Pro lie Cys Pro Gly Tyr Arg 
115 120 125 

tgg atg tgt ctg egg cgt ttt ate ata ttc etc ttc ate ctg ctg eta 
Trp Met Cys Leu Arg Arg Phe lie lie Phe Leu Phe lie Leu Leu Leu 
130 135 140 

tgc etc ate ttc ttg ttg gtt ctt ctg gac tac caa gqt atg ttg tct 
Cys Leu lie Phe Leu Leu Val Leu Leu Asp Tyr Gin Gly Met Leu Ser 
145 150 155 160 

gtt tgt cct eta ctt cca aga aca tea act acc age acg gqa cca tgc 
val cys Pro Leu Leu pro Arg Thr ser Thr Thr ser Thr Gly Pro cys 
165 170 175 

aag acc tgc acg att cct get caa gqa acc tct atg ttt ccc tct tct 
Lys Thr cys Thr lie Pro Ala Gin Gly Thr Ser Met Phe Pro Ser ser 
180 185 190 

tgc tgt aca aaa cct teg gac gqa aac tgc act tgt att ccc ate cca 
Cys Cys Thr Lys Pro Ser Asp Gly Asn Cys Thr Cys lie Pro lie Pro 
195 200 205 

tea tct tgg get ttc gca aga ttc eta tgg gag tgg gee tea gtc cgt 
Ser ser Trp Ala Phe Ala Arg Phe Leu Trp Glu Trp Ala ser val Arg 
210 215 220 

ttc tec tgg etc agt tta eta gtq cca ttt gtt cag tgg ttc gta gqg 
Phe Ser Trp Leu Ser Leu Leu val Pro Phe val Gin Trp Phe val Gly 
225 230 235 240 

ctt tec ccc act gtt tgg ctt tea gtt ata tgg atg atg tgg tat tgg 
Leu Ser Pro Thr val Trp Leu Ser val lie Trp Met Met Trp Tyr Trp 
245 250 255 

gqg cca agt ctg tac aac ate ttg agt ccc ttt tta cct eta tta cca 
Gly Pro Ser Leu Tyr Asn lie Leu Ser Pro Phe Leu pro Leu Leu Pro 
260 265 270 

att ttc ttt tgt ctt tgg gta tac att tga 
lie Phe Phe Cys Leu Trp Val Tyr lie 
275 280 

<210> 10 
<211> 281 
<212> PRT 

<213> Hepatitis B virus 
<400> 10 

Met Gin Trp Asn ser Thr Thr Phe His Gin Ala Leu Leu Asp Pro Arg 
15 10 15 

val Arg Gly Leu Tyr Phe Pro Ala Gly Gly Ser Ser Ser Gly Thr val 
20 25 30 

Asn Pro Val Pro Thr Thr Ala ser Pro lie Ser Ser lie Phe Ser Arg 
35 40 45 

Thr Gly Asp Pro Ala Pro Asn Met Glu Asn Thr Thr Ser Gly Phe Leu 
50 55 60 
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Gly Pro Leu Leu Val Leu Gin Ala Gly Phe Phe Leu Leu Thr Arg lie 
65 70 75 80 

Leu Thr lie Pro Gin Ser Leu His Ser Trp Trp Thr Ser Leu Asn Phe 
85 90 95 

Leu Gly Ala Ala pro Thr cys Leu Gly Gin Asn Ser Gin ser Pro Thr 
100 105 110 

Ser Asn His Ser Pro Thr Ser Cys Pro Pro lie Cys Pro Gly Tyr Arg 
115 120 125 

Trp Met Cys Leu Arg Arg Phe lie lie Phe Leu Phe lie Leu Leu Leu 
130 135 140 

Cys Leu lie Phe Leu Leu val Leu Leu Asp Tyr Gin Gly Met Leu Ser 
145 150 155 160 

val Cys Pro Leu Leu Pro Arg Thr ser Thr Thr Ser Thr Gly Pro Cys 
165 170 175 

Lys Thr Cys Thr lie Pro Ala Gin Gly Thr Ser Met Phe Pro Ser Ser 
180 185 190 

Cys Cys Thr Lys Pro Ser Asp Gly Asn Cys Thr Cys lie Pro lie Pro 
195 200 205 

ser ser Trp Ala Phe Ala Arg Phe Leu Trp Glu Trp Ala ser val Arg 
210 215 220 

Phe Ser Trp Leu Ser Leu Leu Val Pro Phe val Gin Trp Phe val Gly 
225 230 235 240 

Leu Ser Pro Thr Val Trp Leu Ser Val lie Trp Met Met Trp Tyr Trp 
245 250 255 

Gly Pro Ser Leu Tyr Asn lie Leu Ser Pro Phe Leu Pro Leu Leu Pro 
260 265 270 

lie Phe Phe cys Leu Trp val Tyr lie 
275 280 

<210> 11 
<211> 846 
<212> DNA 

<213> Hepatitis B virus 

<220> 

<221> CDS 

<222> CD . . C846) 

<223> Pres2-S coding region of hepatitis B virus ayw 
surface antigen (HBsAg) 

<400> 11 

atg cag tgg aat tec aca ace ttc cac caa act ctg caa gat ccc aga 

Met Gin Trp Asn ser Thr Thr phe His Gin Thr Leu Gin Asp Pro Arg 
15 10 15 

gtq aga gqe ctg tat ttc cct get gqt gqe tec agt tea gga aca gta 
val Arg Gly Leu Tyr Phe Pro Ala Gly Gly Ser Ser ser Gly Thr Val 
20 25 30 

aac cct gtt ctg act act gec tct ccc tta teg tea ate ttc teg agg 
Asn Pro Val Leu Thr Thr Ala Ser Pro Leu Ser ser lie Phe Ser Arg 
35 40 45 

att gqg gac cct gcg ctg aac atg gag aac ate aca tea gga ttc eta 
lie Gly Asp Pro Ala Leu Asn Met Glu Asn lie Thr ser Gly Phe Leu 
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50 55 60 

gqa ccc ctt etc gtq tta cag gcg ggg ttt ttc ttg ttg aca aga ate 
Gly Pro Leu Leu vat Leu Gin Ala Gly Phe Phe Leu Leu Thr Arg lie 
65 70 75 80 

etc aca ata ccg cag agt eta gac teg tgg tgg act tct etc aat ttt 
Leu Thr lie Pro Gin Ser Leu Asp Ser Trp Trp Thr Ser Leu Asn Phe 
85 90 95 

eta ggg gga act ace gtq tgt ctt ggc caa aat teg cag tec cca acc 
Leu Gly Gly Thr Thr val cys Leu Gly Gin Asn ser Gin ser Pro Thr 
100 105 110 

tec aat cac tea cca acc tct tgt cct cca act tgt cct ggt tat cgc 
Ser Asn His Ser Pro Thr Ser Cys Pro Pro Thr Cys Pro Gly Tyr Arg 
115 120 125 

tgg atg tgt ctg egg cgt ttt ate ate ttc etc ttc ate ctg ctg eta 
Trp Met cys Leu Arg Arg Phe lie lie Phe Leu Phe lie Leu Leu Leu 
130 13 5 140 

tgc etc ate ttc ttg ttg gtt ctt ctg gac tat caa ggt atg ttg ccc 
Cys Leu lie Phe Leu Leu Val Leu Leu Asp Tyr Gin Gly Met Leu Pro 
145 150 155 160 

gtt tgt cct eta att cca gqa tec tea aca acc age acg gga cca tgc 
val cys Pro Leu lie Pro Gly Ser ser Thr Thr ser Thr Gly Pro cys 
165 170 175 

egg acc tgc atg act act get caa gga acc tct atg tat ccc tec tgt 
Arg Thr Cys Met Thr Thr Ala Gin Gly Thr ser Met Tyr Pro ser cys 
180 185 190 

tgc tgt acc aaa cct teg gac gga aat tgc acc tgt att ccc ate cca 
cys Cys Thr Lys Pro Ser Asp Gly Asn cys Thr cys lie pro lie Pro 
195 200 205 

tea tec tgg get ttc gga aaa ttc eta tgg gag tgg gec tea gec cgt 
Ser Ser Trp Ala Phe Gly Lys Phe Leu Trp Glu Trp Ala Ser Ala Arg 
210 215 220 

ttc tec tgg etc agt tta eta gtg cca ttt gtt cag tgg ttc gta ggg 
phe ser Trp Leu ser Leu Leu val pro Phe val Gin Trp Phe val Gly 
225 230 235 240 

ctt tec ccc act gtt tgg ctt tea gtt ata tgg atg atg tgg tat tgg 
Leu Ser Pro Thr Val Trp Leu Ser val lie Trp Met Met Trp Tyr Trp 
245 250 255 

ggg cca agt ctg tac age ate ttg agt ccc ttt tta ccg ctg tta cca 
Giy Pro Ser Leu Tyr Ser lie Leu Ser Pro Phe Leu Pro Leu Leu Pro 
260 265 270 

att ttc ttt tgt ctt tgg gta tac att taa 
lie Phe Phe Cys Leu Trp Val Tyr lie 
275 280 

<210> 12 
<211> 281 
<212> PRT 

<213> Hepatitis B virus 
<400> 12 

Met Gin Trp Asn Ser Thr Thr Phe His Gin Thr Leu Gin Asp Pro Arg 
15 10 15 

val Arg Gly Leu Tyr Phe Pro Ala Gly Gly Ser Ser ser Gly Thr Val 
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20 25 30 

Asn Pro val Leu Thr Thr Ala Ser Pro Leu Ser ser lie Phe Ser Arg 
35 40 45 

lie Gly Asp Pro Ala Leu Asn Met Glu Asn lie Thr Ser Gly Phe Leu 
50 55 60 

Gly Pro Leu Leu Val Leu Gin Ala Gly Phe Phe Leu Leu Thr Arg lie 
65 70 75 80 

Leu Thr lie Pro Gin Ser Leu Asp ser Trp Trp Thr ser Leu Asn Phe 
85 90 95 

Leu Gly Gly Thr Thr Val Cys Leu Gly Gin Asn Ser Gin Ser Pro Thr 
100 105 110 

Ser Asn His Ser Pro Thr ser Cys Pro Pro Thr Cys Pro Gly Tyr Arg 
115 120 12 5 

Trp Met Cys Leu Arg Arg Phe lie lie Phe Leu Phe lie Leu Leu Leu 
130 135 140 

Cys Leu lie Phe Leu Leu val Leu Leu Asp Tyr Gin Gly Met Leu Pro 
145 150 155 160 

Val Cys Pro Leu lie Pro Gly Ser Ser Thr Thr Ser Thr Gly Pro Cys 
165 170 175 

Arg Thr Cys Met Thr Thr Ala Gin Gly Thr Ser Met Tyr Pro Ser Cys 
180 185 190 

Cys Cys Thr Lys Pro Ser Asp Gly Asn cys Thr cys lie Pro lie Pro 
195 200 205 

Ser ser Trp Ala Phe Gly Lys Phe Leu Trp Glu Trp Ala Ser Ala Arg 
210 215 220 

Phe Ser Trp Leu ser Leu Leu Val Pro Phe Val Gin Trp Phe Val Gly 
225 230 235 240 

Leu ser pro Thr val Trp Leu ser val lie Trp Met Met Trp Tyr Trp 
245 250 255 

Gly Pro Ser Leu Tyr Ser lie Leu Ser Pro Phe Leu Pro Leu Leu Pro 
260 265 270 

lie Phe Phe Cys Leu Trp Val Tyr lie 
275 280 

<210> 13 
<211> 40 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: forward hbv ayw 
pcr primer 

<400> 13 

ccgggaattc ctcgacacca tgcagtggaa ttccacaacc 

<210> 14 

<211> 39 

<212> DNA ✓ 

<213> Artificial Sequence 

<220> 
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<223> Description of Artificial Sequence: reverse HBV ayw 
PCR primer 

<400> 14 

ccggggtacc caaagacaaa agaaaattgg taacagcgg 



<210> 15 
<211> 849 
<212> DNA 

<213> woodchuck hepatitis B virus 

<220> 

<221> CDS 

<222> (1)..(849) 

<223> Woodchuck hepatitis B virus (WHV8) surface antigen 
<400> 15 

atg aaa aat cag act ttt cat etc cag gqg ttc gta gac gga tta cga 
Met Lys Asn Gin Thr Phe His Leu Gin Gly Phe val Asp Gly Leu Arg 
15 10 15 

gac ttg aca aca acg gaa cgc caa cac aat gec tat gga gat cct ttt 
Asp Leu Thr Thr Thr Glu Arg Gin His Asn Ala Tyr Gly Asp Pro Phe 
20 25 30 

aca aca eta age cct gcg gtt cct act gta tec ace ata ttg tct cct 
Thr Thr Leu Ser Pro Ala Val Pro Thr val Ser Thr lie Leu ser pro 
35 40 45 

ccc teg acg act gqg gac cct gca ctg tea ccg gag atg tea cca tea 
Pro Ser Thr Thr Gly Asp Pro Ala Leu Ser Pro Glu Met Ser Pro ser 
50 55 60 

agt etc eta gga etc etc gca gga tta cag gtg gtg tat ttc ttg tgg 
Ser Leu Leu Gly Leu Leu Ala Gly Leu Gin VaT Val Tyr Phe Leu Trp 
65 70 75 80 

aca aaa ate eta aca ata get cag aat eta gat tgg tgg tgg act tct 
Thr Lys lie Leu Thr lie Ala Gin Asn Leu Asp Trp Trp Trp Thr Ser 
85 90 95 

etc agt ttt cca gqg gqc ata cca gag tgc act gqc caa aat teg cag 
Leu ser Phe Pro Gly Gly lie Pro Glu cys Thr Giy Gin Asn ser Gin 
100 105 110 

ttc caa act tgc aaa cac ttg cca ace tec tgt cca cca act tgc aat 
Phe Gin Thr Cys Lys His Leu Pro Thr ser cys pro Pro Thr cys Asn 
115 120 125 

gqc ttt cgt tgg atg tat ctg egg cgt ttt ate ata tac eta tta gtc 
Gly Phe Arg Trp Met Tyr Leu Arg Arg Phe lie lie Tyr Leu Leu val 
130 135 140 

ctg ctg ctg tgc etc ate ttc ttg ttg gtt etc ctg gac tgg aaa ggt 
Leu Leu Leu cys Leu lie Phe Leu Leu val Leu Leu Asp Trp Lys Gly 
145 150 155 160 

tta ata cct gtc tgt cct ctt caa ccc aca aca gaa aca aca gtc aat 
Leu lie Pro val Cys Pro Leu Gin pro Thr Thr Glu Thr Thr val Asn 
165 170 175 

tgc aga caa tgc aca ate tct gca caa gac atg tat act cct cct tac 
Cys Arg Gin Cys Thr He Ser Ala Gin Asp Met Tyr Thr Pro Pro Tyr 
180 185 190 

tgt tgt tgt tta aaa cct acg gca gga aat tgc act tgt tgg ccc ate 
Cys Cys Cys Leu Lys Pro Thr Ala Gly Asn Cys Thr Cys Trp Pro lie 
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195 200 205 

cct tea tea tgg get tta gqa aat tac eta tgg gag tgg gee tta gee 
Pro ser Ser Trp Ala Leu Gly asp Tyr Leu Trp Glu Trp Ala Leu Ala 
210 215 220 

cgt ttc tct tgg etc aat tta eta gtg ccc ttg ctt caa tgg tta gqa 
Arg Phe ser Trp Leu Asn Leu Leu val Pro Leu Leu Gin Trp Leu Gly 
225 230 235 240 

gqa att tec etc att gcg tgg ttt ttg ctt ata tgg atg att tgg ttt 
Gly lie Ser Leu lie Ala Trp Phe Leu Leu lie Trp Met lie Trp Phe 
245 250 255 

tgg ggg ccc gca ctt ctg age ate tta ccg cca ttt att ccc ata ttt 
Trp Gly Pro Ala Leu Leu Ser lie Leu Pro Pro Phe lie Pro lie Phe 
260 265 270 

gtt ctg ttt ttc ttg att tgg gta tac att tga 
Val Leu Phe Phe Leu lie Trp val Tyr lie 
275 280 

<210> 16 
<211> 282 
<212> PRT 

<213> Woodchuck hepatitis B virus 
<400> 16 

Met Lys Asn Gin Thr Phe His Leu Gin Gly Phe Val Asp Gly Leu Arg 
15 10 15 

Asp Leu Thr Thr Thr Glu Arg Gin His Asn Ala Tyr Gly Asp Pro Phe 
20 25 30 

Thr Thr Leu Ser Pro Ala Val Pro Thr Val Ser Thr lie Leu Ser Pro 
35 40 45 

Pro Ser Thr Thr Gly Asp Pro Ala Leu Ser Pro Glu Met Ser Pro Ser 
50 55 60 

Ser Leu Leu Gly Leu Leu Ala Gly Leu Gin val val Tyr Phe Leu Trp 
65 70 75 80 

Thr Lys lie Leu Thr lie Ala Gin Asn Leu Asp Trp Trp Trp Thr Ser 
85 90 95 

Leu Ser Phe Pro Gly Gly lie Pro Glu Cys Thr Gly Gin Asn Ser Gin 
100 105 110 

Phe Gin Thr Cys Lys His Leu Pro Thr Ser Cys Pro Pro Thr Cys Asn 
115 120 125 

Gly Phe Arg Trp Met Tyr Leu Arg Arg Phe lie lie Tyr Leu Leu val 
130 135 140 

Leu Leu Leu Cys Leu lie Phe Leu Leu Val Leu Leu Asp Trp Lys Gly 
145 150 155 160 

Leu lie Pro Val Cys Pro Leu Gin Pro Thr Thr Glu Thr Thr val Asn 
165 170 175 

Cys Arg Gin Cys Thr lie Ser Ala Gin Asp Met Tyr Thr Pro Pro Tyr 
180 185 190 

Cys cys cys Leu Lys Pro Thr Ala Gly Asn Cys Thr Cys Trp Pro lie 
195 200 205 

Pro Ser Ser Trp Ala Leu Gly Asn Tyr Leu Trp Glu Trp Ala Leu Ala 



73 



EP 1 555 316 A2 



210 215 220 

Arg Phe Ser Trp Leu Asn Leu Leu Val Pro Leu Leu Gin Trp Leu Gly 
225 230 235 240 

Gly lie Ser Leu He Ala Trp Phe Leu Leu lie Trp Met lie Trp Phe 
245 250 255 

Trp Gly Pro Ala Leu Leu Ser lie Leu Pro Pro Phe lie Pro lie Phe 
260 265 270 

Val Leu Phe Phe Leu lie Trp val Tyr lie 
275 280 



<210> 17 
<211> 40 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial sequence: forward WHV PCR 
primer 

<400> 17 

ccgggaattc tcatctccag gggttcgtag acggattacg 



<210> 18 
<211> 39 

<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: reverse WHV 
PCR primer 

<400> 18 

ccggggtacc caaatcaaga aaaacagaac aaatatggg 
<210> 19 
<211> 20 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence :AYWSH FOR primer 
<400> 19 

gccggcagga aggaaatggg 



<210> 20 
<211> 23 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial sequence :AYWSHREV primer 
<400> 20 

ctgctattgt cttcccaatc etc 



<210> 21 
<211> 22 
<212> DNA 

<213> Artificial Sequence 
<220> 
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<223> Description of Artificial sequence: WHVSHFOR primer 
<400> 21 

cgggacatac cacgtggttt ag 



<210> 22 
<2U> 22 
<212> DNA 

<213> Artificial sequence 
<220> 

<223> Description of Artificial Sequence :WHVSHREV primer 
<400> 22 

ggcattaaag cagcgtatcc ac 



<210> 23 
<211> 16 
<212> PRT 

<213> Plasmodium berghei 
<220> 

<221> PEPTIDE 
<222> (1).. (16) 

<223> ci rcumsporozoite CCS) protein-based B-epitope 
<400> 23 

Pro Pro Pro Pro Asn Pro Asn Asp Pro Pro Pro Pro Asn Pro Asn Asp 

15 10 15 

<210> 24 
<211> 20 
<212> PRT 

<213> Plasmodium yoelii 
<220> 

<221> PEPTIDE 
<222> (1)..(20) 

<223> ci rcumsporozoite (CS) protein-based B-epitope 
<400> 24 

Gin Gly Pro Gly Ala Pro Gin Gly Pro Gly Ala Pro Gin Gly Pro Gly 
15 10 15 

Ala Pro Gin Gly 
20 



<210> 25 
<211> 12 
<212> PRT 

<213> Plasmodium berghei 

<220> 

<221> PEPTIDE 

<222> (1)..(12) 

<223> T-helper epitope 

<400> 25 

Lys Gin lie Arg Asp Ser lie Thr Glu Glu Trp ser 
15 10 
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Claims 

1 . An recombinant multivalent antigenic polypeptide that comprises a first antigenic determinant of a first polypeptide 
and at least a second antigenic determinant from a second polypeptide. 

2. The multivalent antigenic polypeptide of claim 1 , wherein the polypeptide comprises at least a third antigenic de- 
terminant from a third polypeptide. 

3. The multivalent antigenic polypeptide of claim 1 , wherein the first and second polypeptides are selected from the 
group consisting of cancer antigens, antigens associated with autoimmunity disorders, antigens associated with 
inflammatory conditions, antigens associated with allergic reactions, and antigens from infectious agents. 

4. The multivalent antigenic polypeptide of claim 3, wherein the antigens are from a virus, a parasite, or a bacteria. 

5. The multivalent antigenic polypeptide of claim 4, wherein the antigens are from a virus selected from the group 
consisting of a Venezuelan equine encephalitis virus or a related alphavirus, a virus of the Japanese encephalitis 
virus complex, a virus of the tick-borne encephalitis virus complex, a Dengue virus, a Hantavirus, an HIV, a hepatitis 
B virus, a hepatitis C virus, and a Herpes simplex virus. 

6. The multivalent antigenic polypeptide of claim 5, wherein the antigens are envelope proteins. 

7. The multivalent antigenic polypeptide of claim 4, wherein the antigens are from a bacteria and are selected from 
the group consisting of a Yersinia V antigen, a Staphylococcus aureus enterotoxin, a Streptococcus pyogenes 
enterotoxin, a Vibrio cholera toxin, an enterotoxigenic Escherichia coli heat labile enterotoxin, aOspA and a OspC 
polypeptide from a Borrelia species, an Antigen 85 polypeptide from a Mycobacterium species, a VacA and a CagA 
polypeptide from Helicobacter pylori, and an MSP antigen from Plasmodium falciparum, 

8. The multivalent antigenic polypeptide of claim 1, wherein the multivalent antigenic polypeptide exhibits reduced 
affinity to IgE from a mammal compared to the first or second polypeptides. 

9. The multivalent antigenic polypeptide of claim 1 , wherein the first antigenic determinant and the second antigenic 
determinant are from different serotypes of a pathogenic organism. 

10. The multivalent antigenic polypeptide of claim 1 , wherein the first antigenic determinant and the second antigenic 
determinant are from different species of pathogenic organism. 

11. The multivalent antigenic polypeptide of claim 1, wherein the first polypeptide and the second polypeptide are 
allergens. 

12. The multivalent antigenic polypeptide of claim 11, wherein the allergens are dust mite allergens, grass pollen 
allergens, birch pollen allergens, ragweed pollen allergens, hazel pollen allergens, cockroach allergens, rice al- 
lergens, olive tree pollen allergens, fungal allergens, mustard allergens, and bee venom. 

13. The multivalent antigenic polypeptide of claim 1, wherein the first polypeptide and the second polypeptide are 
associated with an inflammatory or autoimmune disease. 

14. The multivalent antigenic polypeptide of claim 13, wherein the first polypeptide and the second polypeptide are 
autoantigens associated with a disease selected from the group consisting of multiple sclerosis, scleroderma, 
systemic sclerosis, systemic lupus erythematosus, hepatic autoimmune disorder, skin autoimmune disorder, in- 
sulin-dependent diabetes mellitus, thyroid autoimmune disorder, and rheumatoid arthritis. 

15. The multivalent antigenic polypeptide of claim 1, wherein the first polypeptide and the second polypeptide are 
cancer antigens or sperm antigens. 

16. A recombinant antigen library comprising recombinant nucleic acids that encode antigenic polypeptides, wherein 
the library is obtained by recombining at least first and second forms of a nucleic acid which comprises a polynu- 
cleotide sequence that encodes a disease-associated antigenic polypeptide, wherein the first and second forms 
differ from each other in two or more nucleotides, to produce a library of recombinant nucleic acids. 
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17. The recombinant antigen library of claim 16, wherein the first and second polypeptides are toxins. 

18. A method of obtaining a polynucleotide that encodes a recombinant antigen having improved ability to induce an 
immune response to a disease condition, the method comprising: 

(1) recombining at least first and second forms of a nucleic acid which comprises a polynucleotide sequence 
that encodes an antigenic polypeptide that is associated with the disease condition, wherein the first and 
second forms differ from each other in two or more nucleotides, to produce a library of recombinant nucleic 
acids; and 

(2) screening the library to identify at least one optimized recombinant nucleic acid that encodes an optimized 
recombinant antigenic polypeptide that has improved ability to induce an immune response to the disease 
condition. 

19. The method of claim 1 8, wherein the method further comprises: 

(3) recombining at least one optimized recombinant nucleic acid with a further form of the nucleic acid, which 
is the same or different from the first and second forms, to produce a further library of recombinant nucleic acids; 

(4) screening the further library to identify at least one further optimized recombinant nucleic acid that encodes 
a polypeptide that has improved ability to induce an immune response to the disease condition; and 

(5) repeating (3) and (4), as necessary, until the further optimized recombinant nucleic acid encodes a polypep- 
tide that has improved ability to induce an immune response to the disease condition. 

20. The method of claim 18, wherein the disease-associated polypeptides are selected from the group consisting of 
cancer antigens, antigens associated with autoimmunity disorders, antigens associated with inflammatory condi- 
tions, antigens associated with allergic reactions, and antigens associated with infectious agents. 

21. The method of claim 18, wherein the disease condition is an infectious disease and the first and second forms of 
the nucleic acid each encode an antigen of a different serotype of a pathogenic agent. 

22. The method of claim 21 , wherein the first and second forms of the nucleic acid are each from a different species 
of pathogen. 

23. The method of claim 21 , wherein the screening is accomplished by: 

introducing into a test animal either: 

a) the library of recombinant nucleic acids, or 

b) recombinant polypeptides encoded by the library of recombinant nucleic acids; 
introducing the pathogenic agent into the test animal; and 

determining whether the test animal is resistant to challenge by the pathogenic agent. 

24. The method of claim 23, wherein the pathogenic agent introduced into the test animal is of a different serotype 
than that used as a source of the first and second forms of the nucleic acid. 

25. The method of claim 23, wherein the library is subdivided into a plurality of pools, each of which pools is introduced 
into a test animal to identify those pools that include an optimized recombinant nucleic acid that encodes a polypep- 
tide which has improved ability to induce an immune response to the pathogenic agent. 

26. The method of claim 25, wherein the pools that include an optimized recombinant nucleic acid are further subdivided 
into a plurality of subpools, each of which subpools is introduced into a test animal to identify those pools that 
include an optimized recombinant nucleic acid that encodes a polypeptide which has improved ability to induce 
an immune response to the pathogenic agent. 

27. The method of claim 1 8, wherein the optimized recombinant nucleic acid encodes a multivalent antigenic polypep- 
tide and the screening is accomplished by: 

expressing the library of recombinant nucleic acids in a phage display expression vector such that the recom- 
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binant antigen is expressed as a fusion protein with a phage polypeptide that is displayed on a phage particle 
surface; 

contacting the phage with a first antibody that is specific for a first serotype of the pathogenic agent and 
selecting those phage that bind to the first antibody; 

contacting those phage that bind to the first antibody with a second antibody that is specific for a second 
serotype of the pathogenic agent and selecting those phage that bind to the second antibody; 

wherein those phage that bind to the first antibody and the second antibody express a multivalent antigenic 
polypeptide. 

28. The method of claim 27, wherein the screening further comprises contacting those phage that bind to the first and 
second antibodies with one or more additional antibodies, each of which is specific for an additional serotype of 
the pathogenic agent, and selecting those phage that bind to the respective additional antibodies. 

29. The method of claim 27, wherein the phage display expression vector comprises a suppressible stop codon be- 
tween the recombinant nucleic acid and the phage polypeptide, whereby expression in a host cell which comprises 
a corresponding suppressor tRNA results in production of the fusion protein and expression in a host cell which 
lacks a corresponding suppressor tRNA results in production of the recombinant antigen not as a fusion protein. 

30. The method of claim 18, wherein the optimized recombinant antigen exhibits an enhanced expression level in a 
host cell and the screening is accomplished by expression of each recombinant nucleic acid in the host cell and 
subjecting the host cells to flow cytometry-based cell sorting to obtain those host cells that display the recombinant 
antigen on the host cell surface. 

31 . The method of claim 1 8, wherein the improved property is selected from the group consisting of: 

improved immunogenicity; 

enhanced cross-reactivity against different forms of the disease-associated antigenic polypeptide; 
reduced toxicity; 

improved adjuvant activity in vivo; and 

improved production of the immunogenic polypeptide. 

32. The method of claim 31 , wherein the improved property is enhanced cross-reactivity against different forms of the 
disease-associated polypeptide and the first and second forms of the nucleic acid are from a first and a second 
form of the disease-associated polypeptide. 

33. The method of claim 32, wherein the first and second forms of the disease-associated polypeptide are obtained 
from at least a first and second species of a pathogenic agent and the optimized recombinant nucleic acid encodes 
a recombinant polypeptide that induces a protective response against both species of the pathogenic agent. 

34. The method of claim 33, wherein the recombinant polypeptide induces a protective response against at least one 
additional species of the pathogenic agent. 

35. The method of claim 33, wherein the pathogenic agent is a toxin. 

36. The method of claim 33, wherein the pathogenic agent is a virus or a cell. 

37. The method of claim 33, wherein the disease-associated polypeptide is a Yersinia V-antigen. 

38. The method of claim 37, wherein the at least first and second forms of a nucleic acid are obtained from at least a 
first and second species of Yersinia. 

39. The method of claim 38, wherein the Yersinia species are selected from the group consisting of Y pestis, Y ente- 
rocoiitica, and Y pseudotuberculosis. 

40. The method of claim 33, wherein the pathogenic agent is a bacterial toxin. 

41. The method of claim 18, wherein the disease condition is cancer and the screening step involves introducing the 
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optimized recombinant nucleic acids into a genetic vaccine vector and testing library members for ability to inhibit 
proliferation of cancer cells or inducing death of cancer cells. 

42. The method of claim 41, wherein the optimized recombinant nucleic acid comprises a nucleotide sequence that 
encodes a tumor specific antigen. 

43. The method of claim 41, wherein the optimized recombinant nucleic acid comprises a nucleotide sequence that 
encodes a molecule which is capable of inhibiting proliferation of cancer cells. 

44. The method of claim 18, wherein the disease condition is an inflammatory response which has an unknown or no 
antigen specificity and the screening step involves one or more of the following: 

a) determining the ability of the genetic vaccine vector to induce cytokine production by PBMC, synovial fluid 
cells, purified T cells, monocytes/macrophages, dendritic cells, or T cell clones; 

b) determining the ability of the genetic vaccine vector to induce T cell activation or proliferation; and 

c) determining the ability of the genetic vaccine vector to induce T cell differentiation to T H 1 or T H 2 cells. 

45. The method of claim 18, wherein the disease condition is an autoimmune response. 

46. The method of claim 45, wherein the optimized recombinant antigenic polypeptide shifts the immune response 
from a T H 1 -mediated response to a T H 2-mediated response. 

47. The method of claim 18, wherein the disease condition is an allergic immune response. 

48. The method of claim 47, wherein the optimized recombinant antigenic polypeptide shifts the immune response 
from a T H 2-mediated response to a T H 1 -mediated response. 

49. The method of claim 47, wherein the optimized recombinant antigenic polypeptide induces an immune response 
characterized by predominant IgG and IgM expression and reduced IgE expression. 

50. The method of claim 47, wherein the optimized recombinant antigenic polypeptide is not recognized by pre-existing 
IgE molecules present in sera of atopic mammals. 

51. The method of claim 50, wherein the optimized recombinant antigenic polypeptide retains T cell epitopes that are 
involved in modulating a T cell response. 

52. A method of obtaining a recombinant viral vector which has an enhanced ability to induce an antiviral response in 
a cell, the method comprising the steps of: 

(1) recombining at least first and second forms of a nucleic acid which comprise a viral vector, wherein the 
first and second forms differ from each other in two or more nucleotides, to produce a library of recombinant 
viral vectors; 

(2) transfecting the library of recombinant viral vectors into a population of mammalian cells; 

(3) staining the cells for the presence of Mx protein; and 

(4) isolating recombinant viral vectors from cells which stain positive for Mx protein, wherein recombinant viral 
vectors from positive staining cells exhibit enhanced ability to induce an antiviral response. 

53. The method of claim 52, wherein the viral vector comprises an influenza viral genomic nucleic acid. 
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Figure 8A 



Expression of shuffled envelope gpl20 protein in a genetic vaccine 



rev 

□ 



gpl20 



vpu 



rev 



gp41 



Insert BssH2 site in place of ATG 
(ATGCAA to GGCGCGC) 



Make a silent mutation to introduce 
unique, blunt restriction site 



BssH2 



Blunt Sequences to be shuffled 



rev 
□ 



CMV - 



gpl20 



rev 



RRE 




Figure 8B 

PCR primers for genetic vaccine for HIV 

622 U the beginning of gp 120 7747, the end ofgp!20 



gp!20 



gp41 



^ 6025F caagcttctctatcaaagcagtaagtagtac ^ Primers to generate 

7773R CTTCCTGCTGCTCCCAAGAACCCAA shuffling materials 

6196F ATAGAAAGAGCAGAAGACAGTGGCA ^ Primes to am nlifv 

— 7746R AACAAAGCTCCTATTCCCACTGCTC mmers to ampilty 

shuffled materials 

BssH2 — ► ^ EcoR1 

Primers to clone the PCR 

BssH2-6205F ttggcgcGCAGAAGACAGTGGCAATGAGAGTG 

Ends next to the unique blunt site to be ibtroduced to fragment into genetic 
the vector vaccine vector 
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Figure 9 



Domain Structure of Hepadnavirus Envelope Genes 



preS1 preS2 



Figure 10 

Isolate genes coding for proteins in which the 
immunogenicity of one antigenic domain is 
improved: 
++++ 



preS1 


preS2 
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Figure 1 1 

Shuffle genes for which the immunogenicity 
of one antigenic domain is improved: 

++++ 



preS1 preS2 



++++ 



preS1 preS2 



++++ 



preS1 preS2 



++++ ++++ 



Second round of shuffling 



++++ 



preS1 preS2 



Figure 12 

Transmembrane organization of the HBsAg polypeptide 



Major « a » Epitope 
PreS2 Q 



COOH 
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Figure 13 



Phage Display for Allergens not 
Recognized by Pre-existing IgE 
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TTYYT 



X 1ft XXX 



J Remove variants that 
bind to IgE derived 
from atopic patients 



J Repeat until no 
binding is observed 
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Figure 14 



Screening of Allergen Variants that 
Efficiently Activate Th cells from Atopics 
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In vivo testing 
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Figure 15 



Screening of Cancer Antigens that 
Efficiently Activate T cells from Patients 
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Figure 16: Generation of vectors with multiple T cell epitopes by shuffling libraries of T 
cell epitopes 
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Figure 17: Sequences of PreS2-S Coding Regions of Different HBsAg (or WHsAg) Proteins 



M38454, HPBADR1CG Hepatitis B virus, complete genome. Shown is HBsAg coding 
region (SEQ ID NO: 10); PreS2 starts at 1811 of complete genome sequence, S starts at 
1976, end at 2656. 

ATGCAGTGGAACTCCACAACATTCCACCAAGCTCTGCTAGACCCCAGAGTGAGGGGCCTATA 
CTTTCCTGCTGGTGGCTCCAGTTCCGGAACAGTAAACCCTGTTCCGACTACTGCCTCACCCA 
TATCGTCAATCTTCTCGAGGACTGGGGACCCTGCACCGAACATGGAGAACACAACATCAGGA 
TTCCTAGGACCCCTGCTCGTGTTACAGGCGGGGTTTTTCTTGTTGACAAGAATCCTCACAAT 
ACCACAGAGTCTACACTCGTGGTGGACTTCTCTCAATTTTCTAGGGGCAGCACCCACGTGTC 
TTGGCCAAAATTCGCAGTCCCCAACCTCCAATCACTCACCAACCTCTTGTCCTCCAATTTGT 
CCTGGTTATCGTTGGATGTGTCTGCGGCGTTTTATCATATTCCTCTTCATCCTGCTGCTATG 
CCTCATCTTCTTGTTGGTTCTTCTGGACTACCAAGGTATGTTGTCTGTTTGTCCTCTACTTC 
CAAGAACATCAACTACCAGCACGGGACCATGCAAGACCTGCACGATTCCTGCTCAAGGAACC 
TCTATGTTTCCCTCTTCTTGCTGTACAAAACCTTCGGACGGAAACTGCACTTGTATTCCCAT 
CCCATCATCTTGGGCTTTCGCAAGATTCCTATGGGAGTGGGCCTCAGTCCGTTTCTCCTGGC 
TCAGTTTACTAGTGCCATTTGTTCAGTGGTTCGTAGGGCTTTCCCCCACTGTTTGGCTTTCA 
GTTATATGGATGATGTGGTATTGGGGGCCAAGTCTGTACAACATCTTGAGTCCCTTTTTACC 
TCTATTACCAATTTTCTTTTGTCTTTGGGTATACATTTGA 



HBsAg Protein adr (SEQ ID NO: 1 1): 

MQWNSTTFHQALLDPRVRGLYFPAGGSSSGTVNPVPTTASPISSIFSRTGDPAPNMENTTSG 
FLGPLLVLQAGFFLLTRILTIPQSLHSWWTSLNFLGAAPTCLGQNSQSPTSNHSPTSCPPIC 
PGYRWMCLRRFIIFLFILLLCLIFLLVLLDYQGMLSVCPLLPRTSTTSTGPCKTCTIPAQGT 
SMFPSSGCTKPSDGNCTCIPIPSSWAFARFLWEWASVRFSWLSLLVPFVQWFVGLSPTVWLS 
VIWMMWYWGPSLYNILSPFLPLLPIFFCLWVYIZ 
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Figure 17 (continued) 



J02203; HFBAYW Hepatitis B Virus ayw. (SEQ ID NO: 1 1) 

ATGCAGTGGAATTCCACAACCTTCCACCAAACTCTGCAAGATCCCAGAGTGAGAGGCCTGTA 
TTTCCCTGCTGGTGGCTCCAGTTCAGGAACAGTAAACCCTGTTCTGACTACTGCCTCTCCCT 
TATCGTCAATCTTCTCGAGGATTGGGGACCCTGCGCTGAACATGGAGAACATCACATCAGGA 
TTCCTAGGACCCCTTCTCGTGTTACAGGCGGGGTTTTTCTTGTTGACAAGAATCCTCACAAT 
ACCGCAGAGTCTAGACTCGTGGTGGACTTCTCTCAATTTTCTAGGGGGAACTACCGTGTGTC 
TTGGCCAAAATTCGCAGTCCCCAACCTCCAATCACTCACCAACCTCTTGTCCTCCAACTTGT 
CCTGGTTATCGCTGGATGTGTCTGCGGCGTTTTATCATCTTCCTCTTCATCCTGCTGCTATG 
CCTCATCTTCTTGTTGGTTCTTCTGGACTATCAAGGTATGTTGCCCGTTTGTCCTCTAATTC 
CAGGATCCTCAACAACCAGCACGGGACCATGCCGGACCTGCATGACTACTGCTCAAGGAACC 
TCTATGTATCCCTCCTGTTGCTGTACCAAACCTTCGGACGGAAATTGCACCTGTATTCCCAT 
CCCATCATCCTGGGCTTTCGGAAAATTCCTATGGGAGTGGGCCTCAGCCCGTTTCTCCTGGC 
TCAGTTTACTAGTGCCATTTGTTCAGTGGTTCGTAGGGCTTTCCCCCACTGTTTGGCTTTCA 
GTTATATGGATGATGTGGTATTGGGGGCCAAGTCTGTACAGCATCTTGAGTCCCTTTTTACC 
GCTGTTACCAATTTTCTTTTGTCTTTGGGTATACATTTAA 



Forward HBV avw PCR Primer (with 2 natural EcoRI sites) (SEQ ID NO: 12): 
5 ' -CCGGGAATT CCTCGACACCATGCAGTGGAATTCCACAACC-3 f 



Reverse HBV ayw PCR Primer (with added Kpnl site) (SEQ ID NO: 13): 
5 ' - CCGGGGTACCCAAAGACAAAAGAAAATTGGTAACAGCGG- 3 ' 



HBsAg Protein ayw (SEQ ID NO: 14): 

MQWNSTTFHQTLQDPRWGLYFPAGGSSSGTWPVLTTASPLSSIFSRIGDPAL1SIMENITSG 
FLGPLLVLQAGFFLLTRILTIPQSLDSWWTSIiNFLGGTTVCLGQNSQSPTSNHSPTSCPPTC 
PGYRWMCLRRFIIFLFILLLCLIFLLVLLDYQGMLPVCPLIPGSSTTSTGPCRTCMTTAQGT 
SMYPSCCCTKPSDGNCTCIPIPSSWAFGKFLWEWASARFSWLSLLVPFVQWFVGLSPTVWLS 
VIWMMWYWGPSLYSILSPFLPLLPIFFCLWVYIZ 
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Figure 1 7 (continued) 



J04514 OHVHEPBA Woodchuck hepatitis B virus genome (WHV8) (SEQ ID NO: 15) 
(PNAS 86, 1846-1849, 1989). ATCC No. 45097. EcoRI site at pos. 1 cuts the PreSl 
region. HBs(S) ORF: 296-964. PreS2 starts at 116; PreSl starts at 3039, 

ATGAAAAA.TCAGACTTTTCATCTCCAGGGGTTCGTAGACGGATTACGAGACTTGACAACAAC 
GGAACGCCAACACAATGCCTATGGAGATCCTTTTACAACACTAAGCCCTGCGGTTCCTACTG 
TATCCACCATATTGTCTCCTCCCTCGACGACTGGGGACCCTGCACTGTCACCGGAGATGTCA 
CCATCAAGTCTCCTAGGACTC CTCGC AGGATTACAGGTGGTGTATTT CTTGTGGACAAAAAT 
CCTAACAATAGCTCAGAATCTAGATTGGTGGTGGACTTCTCTCAGTTTTCCAGGGGGCATAC 
CAGAGTGCACTGGCCAAAATTCGCAGTTCCAAACTTGCAAACACTTGCCAACCTCCTGTCCA 
CCAACTTGCAATGGCTTTCGTTGGATGTATCTGCGGCGTTTTATCATATACCTATTAGTCCT 
GCTGCTGTGCCTCATCTTCTTGTTGGTTCTCCTGGACTGGAAAGGTTTAATACCTGTCTGTC 
CTCTTCAACCCACAACAGAAACAACAGTCAATTGCAGACAATGCACAATCTCTGCACAAGAC 
ATGTATACTCCTCCTTACTGTTGTTGTTTAAAACCTACGGCAGGAAATTGCACTTGTTGGCC 
CATCCCTTCATCATGGGCTTTAGGAAATTACCTATGGGAGTGGGCCTTAGCCCGTTTCTCTT 
GGCTCAATTTACTAGTGCCCTTGCTTCAATGGTTAGGAGGAATTTCCCTCATTGCGTGGTTT 
TTGCTTATATGGATGATTTGGTTTTGGGGGCCCGCACTTCTGAGCATCTTACCGCCATTTAT 
TCCCATATTTGTTCTGTTTTTCTTGATTTGGGTATACATTTGA 

Forward WHV PCR Primer (with added EcoRI site) (SEQ ID NO: 16); 
5 ' - CCGGGAATTCTCATCTCCAGGGGTTCGTAGACGGATTACG- 3 ' 



Reverse WHV PCR Primer (with added Kpnl site) (SEQ ID NO: 17): 
5 ' - CCGGGGTACCCAAATCAAGAAAAACAGAACAAATATGGG- 3 ' 



WHV Surface Antigen Protein (SEQ ID NO: 18): 

MKNQTFHLQGFVDGLRDLTTTERQHNAYGDPFTTLSPAVPTVSTILSPPSTTGDPALSPEMS 
PSSLLGLLAGLQVVYFLWTKILTIAQNLDWWWTSLSFPGGIPECTGQNSQFQTCKHLPTSCP 
PTCNGFRWMYLRRFI I YLLVLLLCLI FLLVLLDWKGLI PVCPLQPTTETTVNCRQCTI SAQD 
MYTPPYCCCLKPTAGNCTCWPIPSSWALGNYLWEWALiARFSWLNLLVPLLQWLGGISLIAWF 
LLIWMIWFWGPALLSILPPFIPIFVLFFLIWVYIZ 
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Primers for PCR amplification of the large fragments containing the S2S coding 
sequences (primers are about 200 bp outside the desired sequences) 

AYWSHFOR (SEQ ID NO: 21 ) 

5 ' -GCCGGCAGGAAGGAAATGGG-3 ' (13GC + 7 AT = Tm 66) 



AYWSHREV ( SEQ ID NO: 22) 

5 ' -CTGCTATTGTCTTCCCAATCCTC-3 ' (11GC + 12AT = Tm 68) 



WHVSHFOR (SEQ ID NO: 23) 

5 ' -CGGGACATACCACGTGGTTTAG-3 ' (12GC + 10AT = Tm 68) 



WHVSHREV (SEQ ID NO: 24) 

5 ' -GGCATTAAAGCAGCGTATCCAC- 3 ' (11GC + HAT = Tm 66) 



Figure 18 
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Figure 20 
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